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Antarctic Treaty is cold comfort 


Researchers need to cement the bond between science and the South Pole if the region is to remain 


one of peace and collaboration. 


almost exactly a century ago, in the diary of doomed Antarctic 

explorer Robert Scott, those words mark the moment at which 
the British naval officer realized that the game was up. He had lost the 
race to the South Pole to his Norwegian rival Roald Amundsen, and 
the return journey was to prove worse than Scott imagined. By the end 
of March 1912, Scott and his two remaining companions had perished, 
just 18 kilometres from life-saving supplies. Their ill-fated expedition 
had a scientific slant too, and the zoologist Edward Wilson, who died 
alongside Scott, was the first scientist to see the South Pole. 

Itis perhaps a fitting legacy that the hostile landscape of Wilson's final 
months is now the scene of unprecedented scientific cooperation. Amid 
conflicting territorial disputes, the 1959 Antarctic Treaty handed the 
continent to the world’s researchers, with the explicit goal of ensuring 
“in the interests of all mankind that Antarctica shall continue forever 
to be used exclusively for peaceful purposes and shall not become the 
scene or object of international discord”. Some 30 countries now oper- 
ate research bases in Antarctica, and the Iranian Students’ News Agency 
reported last week that Iran intends to open one within three years. 

The international rush to Antarctica in the name of science has not 
gone unnoticed by some with interests beyond research. In an article 
published in The Australian newspaper on 31 December, Sam Bate- 
man and Anthony Bergin of the Australian Strategic Policy Institute 
in Canberra raised the provocative prospect that countries such as 
China and India could use bases recently opened there to improve 
satellite communications to military forces that increasingly depend 
on space-based infrastructure. “To do so would be at odds with the 
Antarctic Treaty, but the sparse use of the treaty’s inspection mecha- 
nisms means that such activity could go undetected,” Bateman and 
Bergin say. “If Antarctic sites take on military significance, we could 
see a move towards destabilisation of Antarctica as a zone of peace.” 

That may seem a far-fetched scenario, but events may not have to 
take such a dramatic turn to undermine the treaty and weaken the 
monopoly that science currently has on Antarctica. The late Christo- 
pher Joyner of the Department of Government at Georgetown Uni- 
versity in Washington DC identified three potential challenges to the 
cooperative spirit of the agreement, driven by the changing global 
political and economic climate. States might implement national con- 
tinental-shelf claims in offshore Antarctic waters in pursuit of energy 
resources, he suggested, or tensions could escalate between Japan and 
Australia over whaling in Antarctic waters. Joyner’s third scenario 
— widespread and unregulated bioprospecting — is a topic already 
identified as problematic by treaty members, and 
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plants and animals for beneficial genetic and biochemical resources. 
Although the Antarctic Treaty seems under no immediate threat, 
one need only look at the escalating political importance of the Arctic 
to see the potential for change in the south. Just this week, members of 
the Arctic Council were due to discuss requests from China and India 
for official observer status. And even under the strict conditions of 
the treaty, the appeal of Antarctica to some nations searching for new 
mineral and energy resources has never been too far from the surface. 
Scientists can play an important part in 


“There remains preserving the treaty and protecting their 
gr eat appeal unique Antarctic playground, and the 
in the spirit of promise it continues to offer for research. 
Scott’s expedition Interest in the poles is currently sky-high — 
—ofscientistsas witness the success and international syn- 


explorers.” dication of the BBC’s Frozen Planet series 
— and the more that scientists can promote 
the work that they do there, the more the bond between the two will 
be cemented in the public’s mind. There remains great appeal in the 
spirit of Scott’s expedition, of scientists as explorers, and in that vein, 
sometime this year, researchers in Antarctica are likely to be the first to 
penetrate a sub-glacial lake. The long-standing Russian effort to drill 
into Lake Vostok stalled again this month, raising the prospect that, 
if they fail to break through this month, then a British team seeking 
to investigate Lake Ellsworth could beat them to it in December. Not 
that priority — the loss of which so devastated Scott — matters in 
2012. “It’s not a race a spokeswoman for the British project insists. 
We believe them, just. m 


Face up to fraud 


The UK government and funding agencies 
must address research misconduct. 


any people in science would rather not talk about the problem 
M: research misconduct, much less act on it. After all, who 
directly involved would benefit from a serious crackdown? 
Certainly not the institutions at which the misconduct takes place — 
they are nominally responsible, but can face legal repercussions, embar- 
rassing headlines and a public-relations disaster if they expose cheating 
academics. It is much easier to shuffle miscreants out of the side door 
with vague references and a promise of silence, effectively pushing the 
problem somewhere else, and onto someone else. 
So it is perhaps a sort of progress that the British Medical Journal 
and the international Committee on Publication Ethics were able to 
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organize a meeting on the subject in London last week, gathering rep- 
resentatives from universities, funders, journals and lobby groups to 
discuss how the problem could be tackled in the United Kingdom 
(see Nature http://doi.org/hmx; 2012). The meeting broke little new 
ground, but its organizers do, at least, deserve credit for trying. 

A big part of the problem is the lack of perceived risk associated with 
misconduct. Some fraudulent researchers might be sociopaths who 
don’t care about the rules, but many others simply believe that they 
can anticipate the outcome of a research project, and see no downside 
to fabricating the required results to save time, or tweaking results to 
achieve a stronger signal. Either way, stronger action and punishments 
are needed to discourage such misbehaviour. (Meanwhile, for col- 
leagues considering blowing the whistle, the risks are glaringly huge — 
witness the plight of scientists, such as cardiologist Peter Wilmshurst, 
who have raised questions and have faced the full force of Britain’s 
ludicrous libel laws as a result.) 

Could publications such as this one do more to deter cheats? Unfor- 
tunately, we are often in no position to flag up even proven cases of 
misconduct, and thereby highlight the risks that miscreants run with 
their careers. Yes, it is a journal’s primary job to clean up the literature, 
but when papers are retracted owing to misconduct, the libel laws 
(again) often prevent our editors from saying so. We know that this 
leaves the affected communities frustrated and in the dark. It leaves 
us frustrated, too. 

So, with journals unable to push towards greater integrity and uni- 
versities often unwilling to do so, should funding agencies be leading 
the charge? It is, after all, their money that is wasted if misconduct 
does occur. 

Funding agencies in the United States do sometimes investigate mis- 
conduct. Research funded by the National Institutes of Health and 
some other government agencies falls under the remit of the Office for 


Research Integrity (ORI), which has the power to bar researchers from 
receiving future funding. However, as Nicholas Steneck, director of the 
research-ethics programme at the Michigan Institute for Clinical and 
Health Research in Ann Arbor, told the London meeting, this process 
probably misses most major misconduct. And the ORI can't initiate 
investigations: institutions must conduct their own inquiries first. 

In the United Kingdom, there seems to be 


“Stronger little appetite for launching an overarching 
action and ORI-type regulator. Certainly, the exist- 
punishments ing independent advisory group, the UK 
are needed to Research Integrity Office in Falmer, is clear 
discourage that it has no desire to take on such a role. 


British funding councils — in collaboration 
with the country’s universities — have cho- 
sen instead to produce a ‘concordat’ detailing good practice, to which 
institutions will be expected to sign up. This is laudable, but unlikely 
to strike fear into fraudsters and fabricators. 

So, how can Britain highlight cases of misconduct and discourage 
it in future? Ultimately, the incentives probably need to come from on 
high, and the government could get the ball rolling by commissioning an 
anonymous survey on misconduct that UK researchers have witnessed 
and perpetrated. An official audit would offer a strong platform for oth- 
ers to build on — perhaps with a parliamentary inquiry and subsequent 
report on the damage done to UK science by misconduct, and an assess- 
ment of the options for tackling it and the investment needed. Funders 
and universities could then work together to establish common defini- 
tions of what counts as misconduct, and howit will be punished. And if 
areform of the libel laws goes ahead, journals and other scientists would 
be able to do more to highlight and expose miscreants. 

Sounds ambitious? If the solutions were easy, there wouldn't be a 
problem to discuss. But there is, so we must face it. m 


misbehaviour. ” 


Cap in hand 


A word to the wise on getting that much-needed 
research funding. 


ould you be willing to spend weekends on the yacht of a 
W events billionaire in the name of science? Or insist to air- 

port check-in staff that your life-saving research demands 
that you be upgraded from economy to business class? Perhaps you 
would be happy to see your face on a T-shirt? Or for folks you met on 
the Internet to traipse through your lab, taking photographs? 

Welcome to the cold reality of science in a global recession. As the 
flow of public money slows across the world, academic researchers 
are increasingly turning to private funders and wealthy individuals- 
turned-philanthropists to pay for their work. There is a strong tradition 
of such support already, of course, especially in the biomedical field. 
But as government cuts around the world begin to bite, more and more 
scientists will be looking for alternative sources of income. 

So, ina short series of articles this week, Nature focuses on where 
that money is and how you can access it. Do not feel too proud to ask. 
Some of your competitors are doing it already. 

In our News Features, we look at the two extremes of research paid 
for by private individuals — from the billionaires willing to set up 
entire laboratories and pay for the work done there for years, to the 
web-based begging bowls that can take just a few dollars each from 
thousands of different people. 

On page 254, we talk to those at the top: the big spenders, the entre- 
preneurs — and those scientists who have benefited from their largesse. 
How did they do it? Partly by being in the right place at the right time, 
although it helps to know where the right places are. And it helps even 
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more to have something to say when you get there. As Thomas Pierson 
of the SETI Institute in Mountain View, California, says, “People give 
money to people.’ One secret seems to be to think big. “If I ask for 
$100,000 and they say ‘yes’ right away, then I didn't ask for enough,’ one 
university fund-raiser tells us. “It’s a common mistake” 

At the other end of the scale are the crowd-funding websites, 
explored on page 252, where scientists can post details of their pro- 
jects and ask many individuals to collectively cough up for them. 
From a low-cost robot for tackling oil spills to a project to map water 
quality along the Mississippi River, some researchers are already adept at 
tapping the potential of the masses. And although the target donors 
may be different, one key strategy remains the same: tell a good story. 
Sell yourself and sell your science. 

Still, be wary of selling yourself short, warns a Comment piece on 
page 260. Patrick Aebischer, president of the Ecole Polytechnique 
Fédérale de Lausanne in Switzerland, complains that too many dona- 
tions to university research from charities and foundations come with 
a catch — they don't pay for the associated costs, such as salaries and 
utility bills. “Institutions with many privately funded projects are 
effectively ‘punished for their success,’ he writes. “To meet the higher 
research-infrastructure costs, universities may drain resources from 
education, or diminish ‘expensive’ disciplines such as physics, chem- 
istry or engineering, in which philanthropic support is scarce.” The 
solution, he says, is for institutions to identify the full cost of research 
activities and pass it on. “Private bodies should not hijack university 
resources. They should contribute a fair share of the expense.” 

The money is certainly out there. Just look at the billions poured 
into football teams. And if the pages of the glossy magazines can be 
believed, the luxury-goods market remains 
strong. The money must be spent on something, 
so why not science? And although those week- 
ends on luxury yachts may be a tough way to 
make it happen, someone has to do it. m 
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Mars probe crash 


Russia’s Phobos-Grunt 
spacecraft, which failed to 
make it out of Earth orbit in 
its attempt to reach a moon of 
Mars last year, splashed down 
in the southern Pacific Ocean 
on 15 January. As Nature went 
to press, exact details of the 
probe's crash and location 
were unclear, but it is expected 
to have scattered fragments 
over thousands of square 
kilometres. On-board tanks 
containing unused, toxic fuel 
probably broke up well before 
the probe re-entered Earth's 
stratosphere. See go.nature. 
com/qdv3to for more. 


POLICY 


NOAA to move? 

US President Barack Obama 
has proposed moving the 
National Oceanic and 
Atmospheric Administration 
(NOAA) from the 
Department of Commerce to 
the Department of the Interior, 
as part of a broader agency 
reorganization that is intended 
to save taxpayers about 

US$3 billion over 10 years. 
Obama made the proposal 

on 13 January; what the move 
would mean for NOAA is 
unclear. See go.nature.com/ 
efbprv for more. 


Stem-cell lawsuit 
Two scientists seeking to 

block US government funding 
for research using human 
embryonic stem (ES) cells filed 
their arguments to a US appeals 
court on 12 January. James 
Sherley and Theresa Deisher, 
who work on adult stem cells, 
are appealing a July 2011 
decision in which a federal 
judge ruled against their case. 
The appeal is set to be heard 

on 23 April. On 12 January, 

the US National Institutes of 
Health made four more human 
ES-cell lines eligible for use 


Iranian chemical engineer assassinated 


A chemical engineer who worked at Iran’s 
Natanz uranium-enrichment facility was killed 
on 11 January in Tehran — the latest victim ofa 
string of assassinations and attacks apparently 
aimed at Iran’s nuclear programme. Mostafa 


in federally funded research, 
bringing the approved total to 
146. See go.nature.com/wtsnxn 
for more. 


US reactor safety 
The United States may follow 
France in recommending 
that nuclear power plants 
install back-up equipment for 
containing a serious accident, 
in the wake of the meltdowns 
at Japan’s Fukushima plant. 
On 11 January, the Nuclear 
Regulatory Commission 
(NRC) said that it considered 
a plan put forward by the 
Nuclear Energy Institute, a 
body based in Washington DC 
that represents the nuclear 
industry, to be “a reasonable 
starting point”. The plan 
involves placing portable 
pumps, generators and other 
equipment around reactors 
for use in emergencies. The 
NRC is expected to announce 
new safety measures before the 
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11 March anniversary 

of the Fukushima accident. 
See go.nature.com/saeouu 
for more. 


Emissions data 

The US Environmental 
Protection Agency has 
released a public database of 
greenhouse-gas emissions 
from the nation’s largest power 
plants and industrial facilities. 
The agency requires more than 
6,200 facilities to report annual 
emissions, and their 2010 
emissions cover around half of 
the nation’s direct emissions. 
See page 247 for more. 


San Raffaele saved 
The debt-ridden San 

Raffaele Scientific Institute 

in Milan, one of Italy’s most 
prestigious biomedical 
research institutes, will not 

be taken over by the Vatican 


Ahmadi Roshan Behdast, 32, was assassinated 
by a magnetic bomb that had been attached 

to his car (pictured). He was identified in the 
Iranian press as a deputy director of marketing 
at the Natanz facility. See page 249 for more. 


after all. The institute filed 

for bankruptcy last October, 
after accumulating debts of 
€1.5 billion (US$1.9 billion), 
and a consortium led by the 
Vatican Bank put in a bid 

to rescue it (see Nature 478, 
296-297; 2011). But Milanese 
businessman Giuseppe Rotelli, 
head of the private-health 
group San Donato, which 
runs 18 hospitals in northern 
Italy, unexpectedly outbid the 
consortium on 10 January. 
Scientists had been concerned 
that Vatican influence could 
have distorted research 
agendas. 


Low odds at the NIH 


The success rate for scientists 
seeking grants from the US 
National Institutes of Health 
in Bethesda, Maryland, fell to 
a historic low of 18% in 2011, 
the agency announced on 

13 January. The decline was 
driven by an 8% increase since 
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: 2010 in applications submitted 
> for competitive awards, 

to almost 50,000. At 15%, 
women’s success rates were 
equal to those of men for new 
grant applications, but when 
competing for the renewal of 
existing grants, women's 33% 
success rate trailed men’s 36%. 


IV. 
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Clean-energy spend 
Global government spending 
on research and development 
in clean energy fell by 22% 
from 2010 to US$13.2 billion 
last year, according to figures 
released by analysts Bloomberg 
New Energy Finance on 12 
January. Corporate research 
spending also slipped back, 
falling by 14% from 2010 

to $13.2 billion. But total 
investment in the sector — 
which includes the financing 
of energy projects such as large 
solar installations and rooftop 
photovoltaics — surged by 5% 
from 2010 to reach $260 billion 
worldwide. 


Frogs in size war 
Barely a month after a frog 
took the title of world’s tiniest 
tetrapod, an even smaller frog 
has emerged to steal the crown. 
With an average length of just 
7.7 millimetres from snout to 
vent, Paedophryne amauensis 
(pictured) is 1 mm shorter 
than the previous record- 
holder, P. dekot. Both frogs live 
in leaf litter in the forests of 


TREND WATCH 


The US Office of Research 


SOURCE: ORI 


Integrity (ORI) is sharpening its 


eye for plagiarism. In the past, the 


office has censured researchers 
mostly for fabricating (making 
up) and falsifying (manipulating 
or omitting) data, but two of its 
three latest findings involved 
plagiarism. John Dahlberg, 
head of investigative oversight 
at the ORI, says that such 

cases are surfacing because of 
increased use of plagiarism- 
detection software, so the office 
is toughening its stance. See 
go.nature.com/qvp8qg for more. 


Papua New Guinea and make 
up two of the six minuscule 
Paedophryne species so far 
identified on the island. The 
latest find was reported on 

11 January (E. N. Rittmeyer et 
al. PLoS ONE http://doi.org/ 
hmw; 2012). 


Transgenic flight 
The German chemical 

giant BASF is shifting its 
transgenic-plant operations 
from Europe to the Americas, 
it says, because of widespread 
opposition to the technology 
from consumers, farmers 
and politicians at home. On 
16 January, the company said 
that it would move its plant- 
science headquarters from 
Limburgerhof, Germany, 

to Raleigh, North Carolina, 
cutting 140 jobs in Europe, 
although it would retain 
research centres in Berlin and 
Ghent, Belgium. Jonathan 
Jones, a plant researcher at 
the Sainsbury Laboratory in 
Norwich, UK, said the move 


was “a sign that Europe is not 
open for business in this area”. 
See go.nature.com/nramri for 
more. 


Nuclear security 
Australia is the most secure 
of the 32 countries that 

hold at least one kilogram 

of weapons-useable nuclear 
material, according toa 
first-of-its-kind study that 
asked experts to assess risks 
of nuclear theft. North Korea 
is listed last, and the United 
Kingdom ranks highest among 
nuclear-armed states, in tenth 
place. The 11 January study 
(www.ntiindex.org) from 

the Nuclear Threat Initiative, 
anon-profit organization 

in Washington DC, also 
examined security conditions 
in 144 non-nuclear states that 
might be used as safe havens 
or transit points for stolen 
material. 


Astrophysicist dies 


Steven Rawlings, an 
astrophysicist at the University 
of Oxford, UK, who helped to 
instigate and coordinate the 
proposed Square Kilometre 
Array (SKA) telescope 

project, died in mysterious 
circumstances on 11 January, 
aged 50. His body was found at 
the house ofa friend and police 
are now investigating the death. 
Shocked scientists added that 
it was a great loss to the SKA 


US AUTHORITIES CRACK DOWN ON PLAGIARISM 


Plagiarism is increasingly being cited in research misconduct 
findings from the US Office of Research Integrity. 
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COMING UP 


22-27 JAN 

The sixth ‘Arctic 
Frontiers’ conference, 
in Troms, Norway, 
discusses the potential 
of energy resources in 
the Arctic. 
www.arcticfrontiers.com 


23 JAN-17 FEB 

The World 
Radiocommunication 
Conference of 

the International 
Telecommunication 
Union in Geneva, 
Switzerland, will vote 
on whether to abolish 
the leap second, pulling 
our reference time out 
of sync with the Sun (see 
Nature 479, 158; 2011). 
go.nature.com/qqlzi7 


project. Telescope construction 
would not start until 2016, buta 
pivotal decision about whether 
to locate it in South Africa or 
Australia is expected by March. 
See go.nature.com/c8bhhn 

for more. 


Red-wine fraud 


A three-year investigation 
into a biology laboratory at 
the University of Connecticut 
has found its chief guilty of 
falsifying and fabricating data 
in at least 23 papers and 3 grant 
applications. Dipak Das, 
director of the Cardiovascular 
Research Center at the 
University of Connecticut 
Health Center (UCHC) in 
Farmington, is a cardiologist 
whose studies included work 
on the red-wine chemical 
resveratrol. On 11 January, the 
UCHC issued a 60,000-page 
report detailing extensive 
misconduct; on the same day, 
Das was dismissed as co-editor- 
in-chief of Antioxidants & 
Redox Signaling, and two of 
his articles in the journal were 
retracted. See go.nature.com/ 
faxbyj for more. 
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THE GAS TRACKER 


IN FOCUS | NEWS | 


The US Environmental Protection Agency’s online greenhouse-gas database covers about half of the country’s 2010 direct emissions. About 
three-quarters of those are produced by power plants; around 45% come from 155 facilities (just 2.5% of those that submitted data), each of which 
emits at least 5 million tonnes of carbon dioxide equivalent (MtCO,e) a year. 
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Greenhouse-gas emissions in the United States have waned slightly owing 
to a downturn in the economy, as well as a move from coal to natural gas 
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Database tallies US emissions 


Environment agency launches searchable public log of major greenhouse-gas emitters. 


BY JEFF TOLLEFSON 


eeping track of a country’s greenhouse 

gases is an accounting problem of epic 

proportions. In the United States, 
scientists have relied on a mix of methods to 
build up their national emissions inventory, 
including monitoring the electricity output of 
a power plant and assessing the quality of the 
fuel that powers it. 

Now they have a new resource: official 
data from the companies themselves, col- 
lated into a user-friendly online database 
that was launched on 11 January by the US 


Environmental Protection Agency (EPA). 

“It’s a great resource, and I’m sure people 
will find interesting things to do with it,” says 
Gregg Marland, a geologist at Appalachian 
State University in Boone, North Carolina, 
who led the development of guidelines for the 
Intergovernmental Panel on Climate Change 
on how to construct and use national emis- 
sions inventories. 

The inventory covers industrial, com- 
mercial and government facilities that emit 
more than 25,000 tonnes of carbon diox- 
ide equivalent per year, and was prompted 
by a law passed in 2008 that was meant to 


support the growing efforts towards regulating 
greenhouse gases. Plans for that regulatory 
programme collapsed in 2010, but EPA offi- 
cials continued to work on a database, which 
would still provide investors and consumers 
with statistics to help them pressure industry 
to cut emissions. 

The inventory covers facilities that directly 
emit about half of the country’s total emissions, 
but does not include emissions from agricul- 
ture or land use. The online database allows 
users to compare and rank about 6,200 facili- 
ties by state, sector, and type and volume of 
greenhouse gases, including the ubiquitous > 
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> carbon dioxide and methane and the 
more esoteric fluorinated chemicals. 

The data should help researchers trying 
to measure and track greenhouse-gas emis- 
sions, as well as those studying natural car- 
bon cycles. As scientists pin down exactly 
where anthropogenic carbon emissions 
are coming from, Marland says, they can 
separate out carbon uptake and emissions 
by plants and soils. “The better we under- 
stand both the magnitude and distribution 
of human emissions, the better we under- 
stand what is happening in the biosphere” 


THE USUAL CULPRITS 

Some database searches produce very 
familiar results. Carbon dioxide makes up 
about 95% of the greenhouse-gas emis- 
sions logged (although if emissions from 
agriculture and other sectors in the full US 
inventory are included, that proportion 
would drop to around 83%). The state with 
the largest overall emissions is Texas, which 
logged 387 million tonnes of carbon diox- 
ide equivalent in 2010. And power plants 
overshadow any other stationary sources 
of greenhouse gases, accounting for about 
three-quarters of emissions in the inventory 
(see “The gas tracker’). 

But there are also some eye-opening 
statistics. Just 2.5% of the facilities that have 
submitted data to the EPA are responsible 
for 45% of the emissions, for example. And 
almost all of the United States’ emissions of 
trifluoromethane (also known as HFC-23), 
a potent greenhouse gas, come from just 
two facilities — the Honeywell Interna- 
tional plant in Baton Rouge, Louisiana, 
and the Dupont manufacturing plant in 
Louisville, Kentucky. Since 1990, US pro- 
ducers have voluntarily reduced by about 
85% their emissions of trifluoromethane, 
which is a by-product of the manufacture 
of the refrigerant and chemical feedstock 
chlorodifluoromethane. 

The flood of data does not mean that 
scientists can stop measuring greenhouse- 
gas emissions in the atmosphere. Pieter 
Tans and his team at the National Oceanic 
and Atmospheric Administration’s Earth 
System Research Laboratory in Boulder, 
Colorado, measure greenhouse-gas plumes 
from major facilities through a network of 
tall monitoring towers, for example. The 
annual totals in the database will certainly 
help to improve their atmospheric models, 
he says, but monitoring how emissions 
vary over the course of hours and days is 
still vital. 

Totalling those real-time measures 
should also provide a way to verify the com- 
panies’ annual estimates. “The hypothesis is 
that these are correct emissions estimates,” 
Tans says. “We can test that to see whether 
what is being reported is consistent with 
actual observations.” = 
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EDUCATION 


Evolution advocate 
turns to climate 


Education centre known for battling creationists aims to help 
science teachers convey understanding of global warming. 


BY SUSAN YOUNG 


global warming is mentioned; teachers 

pressured to change lesson plans to avoid 
the subject or portray it as speculative rather 
than a matter of scientific consensus. For 
Eugenie Scott, the stories and anecdotes fit 
a familiar pattern. Scott is executive direc- 
tor of the National Center for Science Edu- 
cation (NCSE) an organization based in 
Oakland, California, with a reputation for 
doggedly defending the teaching of evolution 
in US classrooms. But a growing impression 
that climate science is facing a similar strug- 
gle, together with entreaties from educators 
and textbook authors, has helped to con- 
vince her that the NCSE should expand its 
mandate to include the politically charged 
issue of global warming. 

“I think we can make an important contribu- 
tion,’ says Scott. “If teachers understand that 
there is a place that they can go to for help, we 
can use some of the expertise that we've gained 
over the years dealing with evolution to apply 
to this related problem” 

A recent survey in the United States 
suggests that there is indeed a problem. From 
August to October 2011, the nation’s National 
Earth Science Teachers Association (NESTA) 
queried Earth- and space-science teachers 
about their experiences of climate-change 
education. Depending on the region, 25-30% 
of respondents reported that students, parents, 
administrators or other community members 
had argued with them that climate change is 
not happening or that it is not the result of 
human activity. Some school boards and state 
legislators have threatened to require educators 
to ‘teach the controversy about climate change 
— aterm coined in relation to evolution that 
amounts to presenting a scientific theory as 
one of various possible viewpoints. 

Many of the teachers surveyed have strong 
science backgrounds and professional 
experience related to climate change. But “when 
you look at what the teachers are facing across 
the country, it goes way beyond science’, says 
Roberta Johnson, executive director of NESTA, 
based in Boulder, Colorado. “It goes into areas 
of political debate.” 

Ina 16 January announcement, the NCSE 


. tudents walking out of classrooms when 


201.2 


© 2012 Macmillan Publishers Limited. All rights reserved 


says that it will offer support to educators 
facing ideological opposition when teach- 
ing climate change, providing advice on how 
to present the underlying science. The strat- 
egy mirrors its approach to evolution, which 
includes clarifying for students why science is 
an appropriate tool for understanding the nat- 
ural world. “This perspective is also important 
in helping people to understand the reasons 
why scientists overwhelmingly accept climate 
change,’ the NCSE says in a mission statement 
describing the new effort. 

The statement also says that the NCSE will 
not take a position on what, ifanything, should 
be done to counteract global warming or mit- 
igate its effects. “What to do about it ranges 
widely and gets outside of the strict science and 
into policy issues in which many, many vari- 
ables are going to have to be considered,’ says 
Scott. “We are not a policy think tank; we don’t 
have expertise in this area.” 

But such policy neutrality may not prevent 
science teachers from being challenged in 
the classroom. “The core issue is not whether 

global warming 


“Whenyoulook is happening, or 
at what the whether humans 
teachers are are involved, but 
facing across whether it is a crisis,” 
the country, it says James Taylor, a 
goes way b eyon d research fellow at the 


Heartland Institute, a 
libertarian think tank 
based in Chicago, 
Illinois, that opposes the regulation of carbon 
emissions (see Nature 475, 440-441; 2011). 

Scott acknowledges that there is more to 
teaching climate change than explaining the 
science clearly. “We need to be aware of the 
fact that people are very emotionally con- 
cerned about these issues,” she says. If people 
feel threatened ideologically, politically or eco- 
nomically, “all the science in the world won't 
convince them”. She adds that the NCSE will 
also help teachers to understand the views of 
parents and others who oppose the teaching of 
climate change. 

“Knowing the motivations behind a 
parent's views can help a teacher come up with 
a solution or response that might assuage that 
parent's concerns and let their kid remain in the 
classroom,’ says Scott. m 


science.” 
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NUCLEAR FALLOUT 


IN FOCUS | NEWS | 


Several scientists have been targeted for assassination in lran over the past two years, although some have no verifiable connection to the country’s nuclear programme. 


Masoud Alimohammadi 
Particle physicist at the 
University of Tehran, no link 
to nuclear programme. 
Killed 12 January 2010 


NUCLEAR POLITICS 


Fereydoun Abbasi-Davani 
Senior nuclear scientist at 
defence ministry, now head 
of nuclear programme. 
Injured 29 November 2010 


Majid Shahriari 

Nuclear physicist, possibly 
involved in nuclear 
programme. 

Killed 29 November 2010 


Dariush Rezaei-Nejad 
Electrical-engineering 
student, no link to nuclear 
programme. 

Killed 23 July 2011 


Mostafa Ahmadi Roshan 
Behdast 

Deputy director of marketing 
at Natanz uranium facility. 
Killed 11 January 2012 


urders unlikely to slow 
Iran’s nuclear efforts 


Experts say international sanctions are the best way to stall the weapons programme. 


BY SHARON WEINBERGER IN BEIRUT 


ast week's assassination of an official 
Lee at Iran’s uranium-enrichment 

facility was worthy ofa James Bond film: 
the 32-year-old was killed by a magnetic bomb 
placed on his car by a passing motorcyclist. 

But the murder of Mostafa Ahmadi Roshan 
Behdast is unlikely to result in a neat filmic 
denouement. His death is the latest in a string 
of assassinations and other attacks seemingly 
aimed at Iran’s nuclear programme over the 
past few years (see ‘Nuclear fallout’). 

Although experts agree that at least some 
of the killings are part of an organized foreign 
campaign to slow Iran's efforts to enrich ura- 
nium, they are sceptical that the strategy will 
work. “The immediate effect is very small,” 
says Olli Heinonen, a senior fellow at Harvard 
University’s Belfer Center for Science and 
International Affairs in Cambridge, Massachu- 
setts. “If I have a project that is important for 
national security, I never count on one single 
person,’ he says, suggesting that international 
sanctions are a more effective way of slowing 
Iran’s efforts. 

Heinonen, former deputy director-general 
of the International Atomic Energy Agency in 
Vienna, points out that there is still no proof 
of who ordered the assassination, or why. 
Ahmadi Roshan was named in many Western 
press reports as a nuclear scientist (his degree 
was actually in chemical engineering), but the 


Iranian press identified him as deputy director 
of marketing at the uranium-enrichment facility 
in Natanz, suggesting that he was nota crucial 
figure in Iran’s nuclear programme. 

Peter Zimmerman, a physicist and emeritus 
professor at King’s College London, says that 
killing key personnel could delay a nuclear 
programme in the right circumstances. For 
example, killing Leslie Groves, the military 
chief of the Manhattan Project during the 
Second World War, might have had serious 
consequences if it had happened in 1942 or 
1943 when the atomic-bomb project was 
in doubt and Groves’ advocacy was crucial. 
But “if I had killed [J. Robert] Oppenheimer 
in 1944, I don’t think it would have delayed 
Hiroshima by even a month, and maybe not 
a week’, suggests Zimmerman. In Iran, he 
says, none of the victims so far is crucial to the 
nuclear programme. 

Even identifying key personnel in the 
programme can be difficult, because scien- 
tists suspected of involvement often have 
academic affiliations. In an interview in 
2007 with this reporter, Fereydoun Abbasi- 
Davani introduced himself as a university 
professor and head of the Nuclear Society of 
Iran, an academic association for promot- 
ing nuclear knowledge. But he was also ona 
United Nations sanctions list for his alleged 
involvement in Iran’s clandestine military 
programme. In 2010, an attempt to assassinate 
Abbasi-Davani failed, and he was subsequently 


appointed as the head of Iran’s Atomic Energy 
Organization in Tehran. 

At the time, Abbasi-Davani described efforts 
to slow Iran’s nuclear programme as “fruitless”, 
and resulting in only short-term setbacks. For 
example, he said, the United States allegedly 
led a ban on selling software to Iran that could 
be used for calculating nuclear reactions. 
Abbasi-Davani said that Iranian scientists 
eventually developed their own software, in 
part using information from contacts abroad. 
“We were slowed down, but we could get the 
knowledge and know-how that we needed in 
other ways,” he said. 

A similar recovery was seen after the Stuxnet 
computer worm wreaked havoc with the 
uranium-enrichment centrifuges at Natanz in 
2010, says Scott Kemp, an associate research 
scholar at Princeton University’s Woodrow 
Wilson School for Public and International 
Affairs in New Jersey. “They overcompen- 
sated, they began enriching even faster, and 
within six weeks, they were caught up,’ he 
says. Soon, he adds, the facility had moved 
even closer to being able to produce enough 
highly enriched uranium for a nuclear 
weapon. 

Kemp suggests that killing scientists who 
are allegedly associated with the programme 
could have the same effect, by bolstering Iran’s 
resolve to develop its nuclear capability. “These 
things have unintended and unforeseeable 
consequences,” he notes. = 
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PHARMACEUTICALS 


Russian drug law hinders 
clinical trials 


Legislation to increase availability of new medicines has delayed approvals. 


BY ALLA KATSNELSON 


law aimed at improving Russians’ 
Ax to innovative medicines has 
backfired badly. 

“On the circulation of medicines’, a law 
that came into effect in September 2010, was 
intended to streamline the bureaucratic system 
for testing and registering new drugs in Russia. 
There were high hopes that it would also boost 
the nascent domestic pharmaceutical indus- 
try and attract foreign companies to run trials 
there. The law set a maximum cost and time 
limit for drug registrations, for example; man- 
dated compensation for those injured during 
trials; and declared that drugs manufactured 
in Russia solely for export did not need to 
undergo registration. 

But the law has actually hampered clinical 
trials and limited the number of drug approv- 
als over the past year, according to a report 
from the Association of Clinical Trials Organi- 
zations (ACTO) in Moscow, which represents 
companies conducting clinical trials in Russia. 

One problem is that medicines must now 
undergo local trials before they can be sold in 
Russia, regardless of whether they have already 
undergone trials or been approved elsewhere. 
The goal is to ensure that drugs are backed by 
high-quality trials that prove their effective- 
ness in the indigenous population. A few Asian 
countries have similar rules, but whereas some 
Asians are thought to metabolize certain drugs 
in a different way from Europeans, that is not 
the case for Russians, says Leonid Kokovin, a 
clinical-trials manager based in Moscow. “We 
are statistically like European and American 
subjects, so the argument for testing on our 
population isn’t understandable,” he says. 

The law also raises ethical concerns because 
it subjects people to clinical trials that are not 
expected to yield extra knowledge, Kokovin 
notes. He suggests that medicines backed by 
suitable foreign trials should be approved by the 
Russian authorities, as long as they are tracked 
in post-marketing studies. “That would be 
much simpler, and wouldnt put such a barrier 
on the market,” he says. 

ACTO’s report describes one (anonymized) 
European company’ struggle to register a drug 
for preventing preterm delivery, a condition 
that affects about 5% of pregnancies. The 
medicine is already approved for this use in 


many Western countries, and for other uses 
in Russia. The cost of running a Russian trial 
was estimated to be €1 million (US$1.3 mil- 
lion), and patients in the control group would 
have to be asked to avoid the drug, raising their 
risk of preterm delivery. Furthermore, clinical 
trials on pregnant women are rare in Russia, 
so there are few accredited facilities that can 
screen participants. 

Despite the difficulties, the company is going 
ahead with the trial. However, many others 
have been discouraged bya lack of detail about 
how to comply with the law. “No one can say 
— including, probably the regulatory agency 
— what kinds of trial specifically should be 


AIITEKA 


Russian pharmacies can’t sell some internationally 
approved drugs, owing to a strict law. 


run, says Dmitry Margolin, co-chairman of 
ACTO’s regulatory committee. “No one can 
give the guarantee that if you do these trials 
then the data will allow you to register the 
medicine.” Margolin says he is aware of “sev- 
eral tens” of instances in which manufacturers 
had planned to seek registration fora medicine, 
but had been deterred by the requirement for 
fresh trials. 

According to information from ACTO and 
the Ministry of Health and Social Develop- 
ment, which oversees the registration of 
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medicines, just three repeat trials are under way 
for innovative medicines already approved out- 
side Russia — two for hepatitis C and one for 
neuralgia — with two more set to start soon. “If 
the situation continues,’ says Margolin, “thena 
large number of new products will not be regis- 
tered and will not reach Russian pharmacies.” 

On the basis of meeting summaries posted 
on the ministry website, ACTO also estimates 
that the ministry’s Ethics Council rejects more 
than 30% of applications to run trials, mostly 
as a result of administrative or clerical errors 
in the applications. To avoid undue influence, 
Russia's new medicines law forbids the Ethics 
Council from directly contacting companies, so 
it cannot request that small errors are corrected. 
By contrast, in Germany, where the rejection 
rate is 1%, approvals are routinely conditional 
on small changes in the application. “It’s one of 
the main flaws of the Russian regulatory sys- 
tem under the new law,’ says Sergei Lomakin, 
a senior associate at the Moscow office of the 
international law firm Baker & McKenzie. 
“The system leads to a bigger number of rejec- 
tions than one that allows scientific dialogue” 
between companies and regulators, he says. 

Another provision of the law — that medi- 
cines developed abroad cannot have their first 
clinical tests in Russia — is hampering foreign 
companies that are trying to develop products 
specifically for the Russian market. Ironically, 
many such companies have agreements with 
government-backed investors, such as the 
nanotechnology venture fund Rusnano, to 
develop new medicines in the country. 

“These changes in the guidelines happened 
at the moment when the country had invested 
so much in innovative drug development 
that there were quite a few drugs which were 
around the corner from starting phase I trials,” 
says Andrei Gudkoy, chief scientific officer 
of Cleveland BioLabs in Buffalo, New York, 
which partners with Rusnano and is strug- 
gling to launch a Russian early-stage trial for 
one of its anticancer compounds. The Ministry 
of Health and Social Development did not to 
respond to Nature’s questions on the matter. 

ACTO and others say that scrapping the 
requirement for local registration trials would 
be an important first step towards fixing the 
situation. Until that can be done, says Margo- 
lin, “companies for the most part are sitting 
and waiting” = 
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Gemini North fires a laser at the sky (shown in time-lapse) to correct for atmospheric distortions. 


ASTRONOMY 


Geminr’s twin 
telescopes reboot 


Adaptive optics put the observatory at the cutting edge. 


BY ERIC HAND 


he Gemini Observatory’s star has 

been slow to rise. The publicly funded 

observatory boasts two of the world’s 
largest telescopes, and offers a rare opportu- 
nity to watch both the northern and southern 
skies from sites in Hawaii and in Chile. Yet it 
has lagged in terms of scientific productivity 
since it opened in 2000, and millions of dol- 
lars have been spent on overambitious, highly 
specialized instruments that were ultimately 
cancelled. “It felt like a child who had gone 
astray,’ says Frederic Chaffee, who witnessed 
Geminis struggles while director of the rival 
Keck Observatory on Hawaii’s Mauna Kea. “It 
was frustrating to watch.” 

But less than a year after Chaffee was drafted 
out of retirement to become Gemini’s interim 
director when his predecessor stepped down, 
the once faltering facility may finally be living 
up to its promise. Last week, at a meeting of 
the American Astronomical Society in Austin, 
Texas, Chaffee showed off results from several 
new instruments. These should benefit a range 
of observers at the twin 8.1-metre telescopes. 
“In the last year, Gemini 
has made huge pro- 
gress in turning around 
the observatory,” says 
David Silva, director of 
the US National Optical 


For a slideshow of 
amazing Gemini 
images, see: 


Astronomy Observatory in Tucson, Arizona. 

The turnaround is good news for astrono- 
mers of Gemini’s seven member nations: the 
United States, the United Kingdom, Canada, 
Chile, Australia, Brazil and Argentina. Many 
are already planning to exploit the new Gem- 
ini multi-conjugate adaptive optics system 
(GeMS), an instrument that saw first light at 
Gemini South in December. 

Adaptive optics are used at large ground- 
based telescopes to correct for image 


PAPER CHASE 


On a per-telescope basis, the scientific 
productivity of the Gemini Observatory 
has lagged behind that of other large 
observatories of equivalent size. 
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distortions caused by the motion of Earth's 
atmosphere. They monitor a bright guide star 
or backscattering from a laser beam fired by 
the observatory to measure the disturbances, 
then rapidly alter the shape of a mirror to 
correct them. But they sharpen only a small 
portion of the telescope’ field of view. Astron- 
omers have known for more than a decade how 
to make corrections over a larger area — by 
performing three-dimensional tomography of 
the sky with multiple lasers (R. Ragazzoniet al. 
Nature 403, 54-56; 2000). But achieving this 
feat has been another matter. 

With its 50-watt laser split into five beams, 
Gemini has managed it first. The area over 
which it has achieved sharpness is more than 10 
times that of other systems. “This is a big step,” 
says Olivier Guyon, an adaptive optics expert at 
Japan's 8.2-metre Subaru Telescope in Hawaii. 
“Tt’s going to be followed by everyone.” Guyon 
adds that the system will help astronomers to 
study the skies more efficiently, by bringing 
large objects, such as nearby galaxies, into focus 
all at once, and by allowing more distant objects 
to be studied in a single snapshot. 

Gemini South also unveiled a long-awaited 
spectrograph in December. This should allow 
users to calculate distances to some of the Uni- 
verse’s most remote objects, such as quasars, 
the luminous cores of distant galaxies contain- 
ing giant black holes. Yet another instrument, 
an imager that could spot planets orbiting dis- 
tant stars, is due by the end of the year. Geminis 
notoriously clunky software interface for sched- 
uling observations has also been overhauled. 

Finding ways to connect with — or at least 
not alienate — Gemini’s far-flung users is 
important, says Chaffee, because astronomers 
rarely fly out to the telescopes to conduct their 
observations. He is establishing a users commit- 
tee that he hopes will represent observers’ con- 
cerns, anda science and technology committee, 
which met last November, that will aim to help 
Gemini avoid repeating the costly and impracti- 
cal instrument choices made in the past. 

But Chaffee believes that instruments such as 
GeMS are already starting to remedy years of 
poor scientific performance. Virginia Trimble, 
an astronomer at the University of California, 
Irvine, says that, on a per-telescope basis, both 
publication and citation counts for Gemini 
have fallen behind those of its rivals (see ‘Paper 
chase’), and the demand for telescope time 
has fallen. However, Gemini deputy director 
Nancy Levenson says that some of this decline 
in demand reflects waning interest by astrono- 
mers in the United Kingdom, which will end 
its membership of Gemini at the end of 2013. 

Gemini’s international agreement will be 
revisited in 2015. By then, Chaffee will have 
long since returned to retirement. His one-year 
contract is up in May, and the Association of 
Universities for Research in Astronomy, which 
manages Gemini, is due to name a new director 
within weeks. “T’'m trying to leave the place in as 
good a shape as I possibly can,” says Chaffee. m 
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Like tt? Pay for tt 


With conventional sources of money drying up, some 
scientists are turning to crowd-funding. 


n October 2010, Cesar Harada found 

himself in New Orleans with little 

money and a big idea. Harada, an engi- 

neer, had been working on oil-spill 
mitigation at the Massachusetts Institute of 
Technology in Cambridge. But he quit the lab 
in frustration at what he saw as a slow pace of 
work and a focus on expensive solutions. He 
travelled south to join the clean-up operation 
for the Deepwater Horizon oil spill in the Gulf 
of Mexico. Once there, his mind turned to a 
futuristic solution: a low-cost clean-up robot 
that local people could build and deploy them- 
selves. Yet his two criteria for the project —a 
quick build and open-source intellectual prop- 
erty — all but ruled out academic or industrial 
funding. 

Harada turned to Kickstarter, a website used 
by authors, film-makers and artists in search of 
project funding. He uploaded a pitch, set a goal 
of raising US$27,500 and listed a series of small 
rewards for donors. Then he started to net- 
work furiously. Money came in from friends 
and engineering colleagues. A few companies 


BY JIM GILES 


heard about his idea; they pitched in several 
thousand dollars each. Word reached people 
he had never met, and they contributed too. 
When Harada’ funding appeal closed in April 
2011, he had raised almost $34,000 — enough 
to assemble a team of engineers and build a 
prototype of the clean-up robot. 


PUBLIC INTEREST 

If Harada’s experience sounds like a one- 
off, think again. Crowd-funding — raising 
money for research directly from the public 
— looks set to become increasingly common. 
Established platforms such as Kickstarter are 
wooing scientists. And similar websites ded- 
icated to connecting scientists with poten- 
tial funders are being built, or have already 
launched. The public seems to be responding. 
Last year, for example, a group of scientists 
wanting to map water quality along the Mis- 
sissippi River raised $64,000 in a trial project 
on an online crowd-funding platform called 
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the Open Source Science Project (OSSP). 

At a time when universities and research 
funding agencies are facing budget cuts, the 
strategy is attracting attention — as are other 
ways to raise philanthropic support (see page 
254). “It’s timely because of what’s happening 
with traditional funding sources,’ says Daniel 
Gutierrez, co-founder of FundaGeek, a crowd- 
funding platform for technology projects that 
launched last month and is based in Yucca 
Valley, California. 

For crowd-funding to make a real difference, 
advocates will have to prove that the process — 
which sometimes sidesteps conventional peer 
review — channels money to good projects, not 
just marketable ones. But if they succeed, there 
may be an unexpected bonus: it might help to 
forge a direct connection between researchers 
and lay people, boosting public engagement 
with science. “This is one of the most appeal- 
ing aspects of crowd-funding,” says Jennifer 
Calkins, an ecologist at Evergreen State Col- 
lege in Olympia, Washington, who has raised 
money for fieldwork on Kickstarter. “We can 
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involve society in the creative journey that we 
make as scientists.” 

Online crowd-funding has already proved its 
worth in other fields. Kiva, a website through 
which individuals loan small amounts to entre- 
preneurs in the developing world, is one nota- 
ble success: more than 600,000 lenders have 
channelled almost $275 million through the site 
since 2005. US President Barack Obama's 2008 
election campaign raised a record-breaking 
$780 million, much of it from small online 
donations. And donors have pledged more 
than $100 million to 13,000 Kickstarter pro- 
jects. By drastically simplifying the process of 
connecting donor with cause, the Internet has 
unleashed a new enthusiasm for giving. 

Scientists have come a little late to the 
crowd-funding party, because they have con- 
ventionally had other funding streams. Jai 
Ranganathan, an ecologist at the University 
of California, Santa Barbara (UCSB), is one of 
several researchers trying to make up for lost 
time. Last November, he helped to launch the 
#SciFund Challenge, an exercise in which close 
to 50 research groups had six weeks to raise 
money through proposals on a crowd-funding 
platform called RocketHub, which mostly 
serves artists and entrepreneurs. The challenge 
raised a total of $76,000. Brian Meece, Rocket- 
Hubs chief executive, based in New York, says 
that research projects are a “new and exciting” 
use for his platform, and that he will retain the 
science section now that the challenge is over. 


CASH FOR QUESTIONS 

Other crowd-funding enthusiasts are devel- 
oping donor sites dedicated exclusively to 
research projects. Sixteen projects are currently 
vying for funds on SciFlies, a site launched last 
November by David Fries, a marine engineer 
at the University of South Florida in St Peters- 
burg. This year, the OSSP hopes to follow up 
on its success with the Mississippi study by 
launching a fund-raising appeal for around ten 
research proposals, says Priyan Weerappuli, a 
neuroscientist at the University of Michigan 
in Ann Arbor and the founder of the project. 

Each site operates in a slightly different way, 
but there are common themes. Researchers 
start by describing and pricing a project, which 
they submit to the site for approval. If accepted, 
the pitch is placed online and donors have a 
few weeks or months to read the proposal 
and make a donation. Some sites operate on 
a non-profit basis and channel all proceeds to 
researchers; others are commercial concerns 
and take a cut of the money raised. 

But although cash-starved scientists are 
lining up to list their projects, some are also 
expressing concerns. Take the issue of peer 

review. SciFlies and 
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projects have “a creative purpose” — as defined 
by the site’s owners. Projects in the #SciFund 
Challenge did not undergo formal peer review: 
Ranganathan and co-founder Jarrett Byrnes, a 
fellow UCSB ecologist, checked only for obvi- 
ous fraud. “I don't care if people have badly 
thought-out projects,” says Ranganathan. 
That may sound like a recipe for shoddy 
science, but crowd-funding advocates say that 
the process has an inbuilt peer-review system, 
driven by the donors. Most donors will hear of 
a project through their social networks. They 
might be former colleagues of the project owner, 
or members of the public interested in an eco- 
logical study site. So project owners put their 
reputation among their peers and supporters on 
the line every time they posta proposal. 
“There’s a strong incentive to be honest,” 
says Kickstarter co-founder Yancey Strickler. 
“Social forces carry a lot of weight.” The sys- 


MASS APPEAL 
How to woo the crowd 


The owners of crowd-funding sites give 
their tips on pitching winning scientific 
proposals. 

@ Create a compelling story about your 
research. Who will it benefit? And how? 
Then tell that story to camera — many 
sites allow project owners to upload 
short videos as part of the pitch. 

@ Devise clever rewards for donors. Think 
about giving away T-shirts decorated 
with project logos or, for big donors, 

a chance to visit your lab. Most sites 
require project owners to offer some 
reward, but bear in mind the time 

and expense required to produce and 
distribute whatever you offer. 

@ Use your social network, online and 
offline. Tell friends about the project, and 
ask them to tell their friends. Tweet it, 
blog it, publicize it on Facebook. 

@ Study previous successful pitches. Talk 
to the researchers behind them. Learn 
what works and incorporate it into your 
pitch. J.6. 


tem also puts a premium on inventive, well- 
thought-out proposals. A poorly conceived 
pitch that attracts no funds will do nothing 
for a scientist’s career; nor will one that never 
delivers on its promises. “It may not be formal 
peer review, but crowd-funding has valida- 
tion based on common trust,” says Meece. 
“It’s a pretty heavy filter” Even Sally Rockey, 
deputy director for extramural research at the 
US National Institutes of Health in Bethesda, 
Maryland, sees benefits in an alternative evalu- 
ation system if it helps organizations to achieve 
their research goals. Peer review “is not the 
only model’, she says. 
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Some sites are trying to enhance this informal 
review process. FundaGeek has a discussion 
forum, the ‘Geek Lounge’, where potential 
donors are encouraged to debate the merits of 
a proposal. Last August, the equivalent forum 
on Kickstarter helped to halt one questionable 
project. The proposal, for a product called the 
Tech-Sync Power System, aimed to develop a 
smartphone app that controls home lighting. 
It attracted more than $27,000 in pledges, but 
Kickstarter users with electronics knowledge 
started to question the viability of the system. 
The project owner, who could not be reached 
for comment, eventually deleted his proposal 
as the criticism mounted, and none of the 
donors lost their money. 


THE HARD SELL 

Another objection to crowd-funding may be 
harder to shake. To sell a project, researchers 
need an attention-grabbing story (see ‘How 
to woo the crowd’). That is easy to construct 
if your subject of study is, say, saving pandas 
or curing cancer. It is less so for researchers 
working on polymers. So will crowd-funding 
prove profitable only for ‘sexy’ science? 

Fries concedes that crowd-funding inherently 
favours certain types of project, particularly 
those in applied research. He is an optimist, say- 
ing that if the approach takes off, conventional 
funding agencies will simply have to compen- 
sate by upping their support for basic science. 

Ranganathan, an enthusiastic communica- 
tor who runs his own podcast, bristles at the 
suggestion that crowd-funding will create a 
two-tier system. “Tt’s all about telling a compel- 
ling story about the research,” he says. “Panda 
researchers start ahead, but I 100% believe 
anyone can do it.” A polymer chemist might, 
for example, focus on new materials that could 
come out of his or her work. 

The pressure to communicate the potential 
fruits of a research project should not be seen 
as a burden, adds Ranganathan. Most crowd- 
funding sites expect project leaders to offer 
donors something in exchange for their con- 
tribution, such as regular updates on the pro- 
gress of the research. For those who make larger 
donations there might be visits to a lab or field 
site. In the case of the Mississippi water-quality 
study, donors in the region were encouraged 
to help with collecting water samples from the 
river. This process should help to forge stronger 
bonds between researchers and the public. 

Whether all this works in wider practice 
remains to be seen, but many welcome the 
experiment. “Science thrives on diversity,’ says 
Jack Stilgoe, who studies science and society at 
the University of Exeter, UK. “We shouldn't be 
afraid of innovations in how it is funded. We 
should be more afraid when research money is 
all getting spent in the same way on the same 
sorts of things.” m SEE EDITORIAL P.238 AND COMMENT P.260 


Jim Giles is a freelance writer based in San 
Francisco, California. 
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sking someone to give you a 
million dollars is not easy. In 1995, 
Bruce Walker was seeking philan- 
thropic support to expand his HIV 
research programme at Massachusetts General 
Hospital in Boston. He had identified a possible 
donor: a venture capitalist and the brother of 
one of his patients. He had teamed up witha 
personal coach in fund-raising and rehearsed 
his pitch over and over. But during lunch with 
the prospective donor, he still nearly choked. “I 
couldn't quite get it out to say, “Would you give 
us a million dollars?’,” Walker recounts. 
Eventually he spat out his request — and was 
astonished when the venture capitalist agreed 
to the entire sum. “At that point, I fell off my 
chair,’ Walker says. He then raised another 
US$100 million from philanthropists Terry 
and Susan Ragon to launch the Ragon Institute 
of MGH, MIT and Harvard in Charlestown, 
Massachusetts, in 2009. He and his colleagues 
are now planning a formal seminar series on 
raising philanthropic support. His success at 
fund-raising, he says, “is not because there's 
something special about me. It’s because I got 
alittle bit of training and I put in a lot of effort” 
Walker is not the only one with a determined 
approach to philanthropy. As public funding 
for research dwindles, 


scientists are increasingly NATURE.COM 
seeking private benefac- Read aComment 
tors (see page 252). But onthe dangers of 
they havealottolearnif _ philanthropy: 
theyaretowintrust and — go.nature.com/te4dz8 


“Sponsor 


MY SClenCce 


Philanthropists will sometimes give large sums 
of money to support science — but researchers 
have to learn how to sell themselves first. 


BY HEIDI LEDFORD 


money: how to schmooze contacts, promote 
their science and deliver results to deadline — all 
without over-promising on the work (see ‘How 
to woo philanthropists’). Much of this does not 
come naturally to scientists. “You have to sell 
yourself? says Cheryl McEwen, who, with her 
husband Rob, donated US$10 million to found 
the McEwen Centre for Regenerative Medicine 
in Toronto, in 2003. “But if you can build a case 
that we can understand, we'll be there for you.” 
To help build that case, research institutions 
— particularly those in the United States — are 
becoming savvier about how to approach phi- 
lanthropists. In April 2010, Steve Rum, the vice- 
president for development and alumni relations 
at Johns Hopkins School of Medicine in Balti- 
more, Maryland, started training faculty at the 
school in fund-raising techniques. Elsewhere, 
scientists are enrolling in classes provided by 
external institutes. Researchers are happy to 
have the help, and not just because it can bring 
in new income. It can also allow them to pursue 
projects that government funding committees 
would find too risky to support. “When you 
come to the government agencies with their 
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committees and extreme accountability and 
taxpayer dollars, everything tends to come out 
a bit plain vanilla,” says Tim Hunt, a Nobel- 
prizewinning biochemist at Cancer Research 
UK in London, who credits philanthropic sup- 
port with helping him to continue his basic cell- 
biology studies even when applied research was 
more in vogue. “It’s valuable to have funding 
that’s a little bit out of the mainstream” 


PERSONAL APPROACH 
Philanthropy is supporting a growing slice of 
science: donations from US foundations to sci- 
ence, technology and medical research have 
grown from $793 million in 1999 to $1.7 billion 
in 2010, according to figures from the Founda- 
tion Center, an organization based in New York 
that analyses information about philanthropy. 
But although many scientists have experience in 
sending grant applications to foundations, few 
try to win multimillion-dollar donations from 
rich and generous individuals. 

Convincing a philanthropist to fund a 
programme means cultivating a personal rela- 
tionship with them. “It’s an old adage, but it’s 


2&3 ILLUSTRATION 


completely true: people give money to people,” 
says Thomas Pierson, chief executive of the 
SETI Institute, an astrobiology research cen- 
tre in Mountain View, California. After SETI 
was cut loose from NASA in 1993, it relied on 
Bernard Oliver, a well-connected member of 
its board and a former head of research and 
development at the computer firm Hewlett 
Packard in Palo Alto, California, to take its case 
to local entrepreneurs. The result: support from 
a whos who of technology luminaries, including 
Microsoft co-founder Paul Allen. 

Most scientists will need to look outside their 
usual network of colleagues and supporters for 
prospective donors, says Robert Klein, a real- 
estate magnate who helped to raise $34.5 mil- 
lion in the 2004 campaign for California's 
$3-billion stem-cell initiative. Most of that, he 
says, came from wealthy families with a vested 
interest — usually a sick relative — in seeing 
stem-cell technologies succeed. To reach these 
families, Klein recommends first building con- 
tacts with patients and those who support them. 
“Have a patient advocate call ahead, and sit next 
to you while you make the pitch,” says Klein. 
“Better yet, if you can get them to do it, have the 
advocate make the pitch for you.” 

Walker urges researchers to be on continual 
alert for new contacts. “When you travel, always 
try to get upgraded to business class,” he says. 
“Then talk to the people around you about what 
you do?” 

This type of schmoozing takes time and 
a certain personality. Rum says that Morton 
Goldberg, an ophthalmologist at Johns Hop- 
kins, is a fund-raising “superstar” — helping to 
build an endowment that now funds more than 
30 professorships. The secret to his success, says 
Rum, is his willingness to forge lasting relation- 
ships with philanthropists. Goldberg says he 
considers many of his donors to be close friends. 
He has travelled with them, spending weekends 
on their boats and in their vacation homes. “It 
has been an extremely rewarding experience, far 
beyond any connection with money,’ he says. 

Many physicians are reluctant to solicit 
their former patients, and with good reason, 
says bioethicist Sheldon Krimsky of Tufts 


University in Boston. “It’s not OK for a doctor 
who is actively treating a patient to do it.’ Even 
after a patient has left the hospital, he or she 
may return, says Krimsky, and a physician who 
has received that patient's support may now be 
expected to provide special treatment. 
Another pitfall of philanthropy, Krimsky 
says, is donors who attach strings to their gifts. 
When the Charles G. Koch Charitable Foun- 
dation in Arlington, Virginia, agreed to fund a 
faculty position in economics at Florida State 


University in Tallahassee, it demanded the right 
to determine the 

“6 criteria used to 
Try to ‘Pp | pick a professor, 
and to veto can- 

up grat e didates it did not 
to business _ iike(s.Krimsky 
Nature 474, 129; 


9 
class. 2011). “It was so 
egregious,’ says 
Krimsky. The university accepted the terms in 
2008, but has argued that input from the foun- 
dation during the hiring process has not com- 
promised its academic integrity. 

Krimsky worries that tough economic times 
can make researchers and their institutions 
more willing to accept intrusions into academic 
freedom in exchange for funding. Scientists 
agree, but several who have received philan- 
thropic support point out that they appreciated 
the sound financial and strategic advice the 
donors offered. 

For many scientists, the most difficult step is 
pitching their work as if they were an entrepre- 
neur in need of investment, says Garen Staglin, 
who, with his wife, Shari, donated some of the 
family’s earnings from their vineyard in Napa 
Valley, California, to mental-health research 
after their son was diagnosed with schizophre- 
nia. “You have to go into sell mode,’ he says. 
“You have to be able to say ‘there are no guar- 
antees here but if this works, this is the kind of 
stuff we think is possible in our lifetime.” But 
that type of sell tends to be easier for applied 
projects, such as potentially life-saving medical 
studies, than for basic research. 

Selling research can also require a 


FUND-RAISING 101 


How to woo philanthropists 


Advice from successful fund-raisers on how 
to sell science. 

@ Network outside your usual circles: look 
for contacts — and contacts of contacts 
—who can strike an emotional chord with 
philanthropists. 

@ Learn how to frame your research and its 
importance in a few sentences. Practise this 
‘elevator pitch’ in front of the mirror, and 
with anyone who will listen. 

@ Aim high: look for hints about how mucha 


donor might be willing to commit. 

@ Establish deadlines: break projects 

into small chunks and assign completion 
dates for each step. Explain the vagaries of 
scientific research to your donor and don’t 
be afraid to adjust timelines. 

@ Reward your investor: send regular 
progress updates, acknowledge their 
sponsorship in publications and 
presentations and encourage contributors 
to come to the lab for visits and tours. HL. 
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willingness to set deadlines. “In difficult eco- 
nomic times, the philanthropies become very 
goal oriented,” Klein says. It’s a practice that 
many scientists resist, because science rarely 
goes according to plan. When it doesn't, fund- 
raising specialists say that it's best to acknowl- 
edge the failure and stress the importance of 
the lessons learned. “You will still give them 
the confidence that there are incremental 
improvements,’ says Klein. 


PHILANTHROPY SCHOOL 

Research institutions often rely on professional 
fund-raisers to approach philanthropists, but 
Johns Hopkins isnt alone in training faculty 
members. Advancement Resources in Cedar 
Rapids, Iowa, has seen a fivefold increase in 
demand for its fund-raising training over the 
past four years, says chief executive Joe Gold- 
ing. Institutions pay $17,500 for a four-hour 
workshop of lectures and role-playing exercises. 

Golding says that basic researchers would 
do well to target entrepreneurs. “They are, by 
definition, risk takers,’ he says. “And they value 
leverage.’ Stress that a $30,000 investment now 
could lead toa million-dollar government grant 
later, he advises, and emphasize the impact a 
project would have on a topic that is close to the 
entrepreneur's heart. Golding, Rum and other 
experts also tell their trainees to listen for clues 
to how much a donor might be willing to com- 
mit. “If ask for $100,000 and they say ‘yes’ right 
away, then I didn’t ask for enough,” says one 
fund-raiser at a large research university who 
asked not to be named. “It's a common mistake.” 

Rum even conducted a trial to find the best 
way to teach faculty members about how to 
approach former patients. He divided faculty 
members into three groups. One received only 
weekly fund-raising lessons by e-mail. The sec- 
ond also attended a lecture on the topic. The 
third received individualized coaching, includ- 
ing how to prepare an ‘elevator pitch’ — a brief 
description of research so compelling that it 
could capture someone's attention in the time 
it takes to travel between floors. Three months 
after coaching, the first two groups had brought 
in no money. The third had collected five gifts 
totalling $219,550 (S. Rum and S. M. Wright 
Acad. Med. 87, 55-59; 2012). 

Walker says that this type of personal coach- 
ing — not to mention the agony of asking for 
money — has been more than worthwhile. “I 
think we as a scientific community don't doa 
great job in articulating the transformational 
power of philanthropy,’ he says. “But that fund- 
ing has allowed me to do things that I never 
could have considered if Id been trying to do it 
through traditional sources” 

Too few researchers take advantage of 
philanthropy, says Goldberg. “The most com- 
mon mistake is not to try.’ m SEE EDITORIALP.238 AND 
COMMENT P.260 


Heidi Ledford is a reporter for Nature based 
in Cambridge, Massachusetts. 
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would spur biomedical 


VACCINES More collaboration 
research innovation p.261 


EXHIBITION The scientific 
legacy of Robert Scott’s 
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The fight over flu 


A proposal to restrict the planned publication of 
research on a potentially deadly avian influenza virus is 
causing a furore. Ten experts suggest ways to proceed. 


RON FOUCHIER & 
AB OSTERHAUS 
Globalize the 
discussion 


Erasmus MC, Rotterdam, 
the Netherlands 


So far, most of the human deaths from the 
deadly H5N1 strain of bird flu have occurred 
in Asia and the Middle East. Many labs world- 
wide — including ours — are trying to under- 
stand what makes the virus so virulent, and 
how to stop it. H5N1 research is thus a global 
issue, yet the entire research community seems 
to be following the advice of one country. 

We are not questioning the unprecedented 
recommendations last month from the US 


National Science Advisory Board for Bio- 
security (NSABB) to remove key details 
from the methods and results sections of 
published papers, including our own, sub- 
mitted to Science (see Nature 481, 9-10; 
2012). But we do question whether it is 
appropriate to have one country dominate 
a discussion that has an impact on scientists 
and public-health officials worldwide. This 
discussion should include the perspective of 
people in regions where H5N1 has infected 
humans. Will the NSABB also advise on 
which international researchers and officials 
have the right to see the full papers, to help 
implement urgently needed surveillance and 
other intervention strategies? 

It is not clear whether an international 
discussion would lead to different recom- 
mendations. There is no global equivalent 
of the NSABB, but many European experts 
that we have seen quoted in the press believe 
that the research should be published in full. 


We don't know the worldwide opinion until 
a group of experts from all parts of the globe 
is formed. An issue this big should not be 
decided by one country, but by all of us. 


JOHN STEINBRUNER 
A system for 
redacted papers 


Director, Center for International 
and Security Studies at Maryland, 
University of Maryland, College Park 


If the two papers (submitted to Nature and 
Science) describing a transmissible form of 
the H5N1 virus are the first to be published 
with key details missing, they probably won't 
be the last. We need to establish both a short- 
term and a long-term solution for how the 
scientific community should handle such a 
publication. Who decides who should have 
access to the full details? Who monitors 
the community so that the details don’t get 
passed around outside the group of experts 
cleared to receive them? 

I believe that the entire process must be 
regulated by a global health body, ideally 
the World Health Organization (WHO). 
Already, a WHO committee oversees all 
research involving the smallpox virus. A 
similar, more developed system could work 
for H5N1 and other deadly pathogens. 
An international group of experts would 
approve research involving those agents, 
decide who will have access to the details 
of papers that come out of that work, and 
hold those vetted individuals accountable 
for what they do with the information. For 
instance, sucha system might allow permit- 
ted experts to view papers only electroni- 
cally, so nothing is on paper. A database 
could record the privileged few people who 
have seen the full paper, and what they do 
with the information. In all likelihood, legal 
safeguards would be needed to protect the 
rights of people who receive the sensitive 
information. Sucha process would not offer 
complete protection against misuse, but it 
would show the scientific community that 
the committee is watching what they do. 

Because dangerous pathogens are a global 
issue, any procedure would need buy-in > 
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> from all countries, who would have to 
give the committee binding jurisdiction 
over research involving extremely dangerous 
agents. This will take some time. In the short 
term, the WHO or some other global health 
organization should immediately establish 
an ad hoc committee to review who should 
receive access to the full H5N1 papers, and 
ensure that the details do not circulate widely. 
Most importantly, these discussions 
should not be controlled by officials focused 
on national security. H5N1 is primarily a 
matter of public health. If there is a threat 
of bioterrorism, let it be judged by a global 
health organization, which can set rules that 
do not deprive scientists of information that 
could save millions of people in the case of 
a natural pandemic. If national-security 
organizations become involved, they will 
vet scientists on the basis of citizenship, and 
will be inclined to discriminate against those 
countries in which terrorists have found ref- 
uge. But some of those countries are among 
the few that have experienced human deaths 
from H5N1, and are most likely to witness 
the origins of a natural pandemic. It is cru- 
cial that scientists and other experts are 
judged on their qualifications, not on their 
nationality. If the world is to accept the pro- 
cess, national-security officials cannot be 
allowed to dominate the discussion. 


KWOK-YUNG YUEN 
The Hong Kong 
perspective 


Chair of Infectious Disease, 
Department of Microbiology, 
University of Hong Kong 


Asa scientist working in Hong Kong — the 
site of the first human epidemic of infection 
by the highly fatal H5N1 virus — I appreci- 
ate the public-health significance of knowing 
which mutations confer airborne transmis- 
sibility in an animal model. The new, much- 
debated research provides this information. 
Finding similar genomic signatures in animal 
or human viruses collected from the WHO 
Global Influenza Surveillance Network may 
alert public-health workers to an impending 
epidemic of unthinkable magnitude or sever- 
ity. But I also appreciate the possibility that 
such mutants could cause a global disaster 
if accidentally or deliberately produced and 
released into animal and human populations. 
Consequently, I support the recommenda- 
tion from the NSABB to remove key details 
from the papers describing this work. 
Biological warfare is familiar to people 
living in this part of the world. During the 
Sino-Japanese War in the 1930s and 1940s, 
scientists and physicians of the Japanese 


army’s Unit 731 infected civilians and prison- 
ers of war in Manchuria, China, with Yersinia 
pestis, the bacterium behind the Black Death, 
which was first identified in plague patients 
in Hong Kong in 1894. When the unit found 
that the bacteria caused severe organ dam- 
age when serially inoculated into prisoners 
of war, the unit began spreading plague-con- 
taminated fleas in China, causing outbreaks. 

Censoring scientific data for publication 
will not stop rogue individuals or nations 
from developing a deadly and highly trans- 
missible form of H5N1, but it would at least 
buy some time to find and stockpile the 
appropriate antivirals, immunomodulators 
and vaccines to protect against most variants 
of H5N1. Even if the publications omit the 
methods for making such a deadly virus, 
the genomic signatures associated with air- 
borne transmissibility should be known to 
the directors of all public-health laboratories 
in the WHO surveillance network, after they 
sign an agreement of confidentiality. 


D. A. HENDERSON 
The ultimate 
biological threat 


Center for Biosecurity, University 
of Pittsburgh Medical Center, 
Pennsylvania 


The H5N1 influenza strain poses a poten- 
tial biological hazard far more serious than 
any we have ever known. It is a virus that is 
capable of killing half its victims, a propor- 
tion greater than that for any other epidemic 
disease. Were that coupled with the trans- 
missibility of a pandemic flu virus, it would 
have characteristics of an ultimate biologi- 
cal weapon unknown even in science fiction 
(see Nature 480, 421-422; 2011). We should 
not publish a blueprint for constructing such 
an organism. 
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South Asian countries such as Nepal are among the most likely to be hit by avian flu epidemics. 


Sy 


LYNN KLOTZ & 

ED SYLVESTER 
Worry about 
lab infections 


Center for Arms Control and 
Non-Proliferation, Washington DC; 
Walter Cronkite School of Journalism 
at Arizona State University 


Asian bird flu is just one of the extremely 
dangerous pathogens researched in labo- 
ratories throughout the world. Along with 
the two labs that created a potentially con- 
tagious form of H5N1, at least 40 others 
worldwide investigate deadly, highly con- 
tagious pathogens not currently present in 
human populations, such as the SARS virus 
and the recently resurrected 1918 pandemic 
flu virus. Public-health experts worry about 
natural pandemics, and governments worry 
about the risk that these pathogens pose 
to national security — but the probability 
of accidental release is likely to be much 
higher. 

We have analysed the likelihood of 
escape from 42 labs, using 1% as the esti- 
mated probability of an escape from a single 
lab in a single year. This approximates the 
historical probability, obtained by divid- 
ing the documented number of escapes of 
these pathogens (3, each involving the SARS 
virus) by our estimate of the total number 
of lab-years of research on these pathogens 
since 2003 (more than 300 lab years). Lab 
infections can easily spread: in 2004, after 
the only natural SARS outbreak in humans 
was contained, two graduate students lab- 
infected with SARS in Beijing infected seven 
others, causing one death. 

Our analysis shows that the probability of 
an escape from at least one of 42 labs ina 
single year is 34%; within less than 4 years, 


N. SHRESTHA/EPA/CORBIS 


the chance of escape reaches 80%. If more 
labs begin studying the (potentially) human 
contagious form of H5N1, that interval will 
decrease even further. 

In comparison, the average time between 
the three natural influenza pandemics in 
the last century was about 30 years (1918, 
1957 and 1968). We are creating a risk that 
is much greater than that posed by nature. 
Laboratories need stronger precautions, such 
as mandatory quarantine of lab staff work- 
ing on live dangerous pathogens until they 
can be certified clear of infection. Regulators 
should not be sitting idly by, while the threat 
of a man-made pandemic looms. 


JEFFERY K. TAUBENBERGER 
Study how viruses 
swap hosts 


Chief, Viral Pathogenesis and Evolution 
Section, Laboratory of Infectious 
Diseases, US National Institute of 
Allergy and Infectious Diseases 


How influenza viruses adapt to humans and 
cause disease has been a significant inter- 
est of my laboratory, including sequencing 
the genome of the 1918 H1N1 influenza, 
the virus that caused the worst pandemic 
on record. This work has led to research on 
how influenza viruses switch hosts and cause 
severe disease, and is serving as a basis for new 
vaccines and therapies. As a result of these 
studies, scientists and public-health officials 
are better prepared to prevent or mitigate a 
similar future pandemic. Studies examining 
adaptation and transmission of avian HSN1 
influenza virus are similarly important. 

For reasons not yet clear, some influenza 
viruses adapted to poultry can acquire sets 
of mutations that kill almost every bird that 
they infect, but rarely infect humans. Since 
1997, H5N1 viruses have devastated poul- 
try flocks in southeast Asia and other parts 
of the world, and also caused disease in a 
limited number of the presumably tens of 
thousands of humans who have had direct 
contact with infected birds. Although human 
infections are uncommon, the high fatality 
rate in those people infected by H5N1 — 
more than 50% — raises serious concerns. 

A growing body of evidence suggests 
that influenza host-switching processes are 
complex, and may be unique to each virus. 
Changes in the 1918 H1N1 virus crucial 
for adapting to humans, for example, were 
not found in the 2009 


pandemic HIN1 virus. NATURE.COM 

It is essential to con- See Nature’s web 
tinue research into this _ focus on the 1918 
with H5N1 and other __ influenza pandemic: 
emerging pathogens, __go.tiaftire.com/Srepjv 


and to investigate how they adapt and cause 
disease. The work could provide better sur- 
veillance tools to detect viruses that are in 
the process of adapting to humans. Scien- 
tists must be able to continue to work with 
these viruses safely and appropriately, in 
multiple teams utilizing the talents of many 
researchers, and under the numerous layers 
of protection and oversight that safeguard 
scientists and the public. 

Of course, scientists will have to continu- 
ously re-examine and discuss the balance 
between the benefits of conducting and 
publishing pathogen research, and the risk 
of misuse of such knowledge. We should 
continue the dialogue as we continue the 
research. 


RICHARD H. EBRIGHT 
Mitigate the 
risks of release 


Department of Chemistry and 
Chemical Biology, Rutgers 
University, Piscataway, New Jersey 


Engineered derivatives of highly patho- 
genic avian influenza virus capable of 
aerosol transmission in non-rodent mam- 
mals pose potential threats to human health 
and to food production. The major concern 
is accidental release, for example through 
infection of a lab worker who then infects 
others. Deliberate release by a disturbed or 
disgruntled lab worker, bioterrorism and 
biowarfare are also concerns. 

The following steps should be taken imme- 
diately. To address accidental release, assign 
the viruses as pathogens that require the high- 
est biosafety level (‘level 4; not ‘level 3+’ as has 
been the case to date). To address deliberate 
release, assign the viruses as pathogens that 
require the highest level of biosecurity (“Tier 
1} like smallpox virus and anthrax bacterium, 
under the revised US select-agent rule that 
enters into force this year). 

To minimize risks from future research, 
additional steps should be taken. We need 
to implement a system of mandatory prior 
review of research directed at increasing a 
pathogen’s virulence, transmissibility or abil- 
ity to evade countermeasures. We also need 
to re-evaluate, and preferably terminate, bio- 
defence expenditures on research directed 
at creating and assessing new biothreats, as 
opposed to addressing existing biothreats. 
Creating and assessing new threats rarely 
increases security. Doing so in biology — 
where the number of potential threats is 
nearly infinite, and where the asymmetry 
between the ease of creating threats and 
the difficulty of addressing threats is nearly 
absolute — is especially counterproductive. 


DAVID L. HEYMANN 
We will always 
need vaccines 


Head, Centre on Global Health 
Security, Chatham House, London 


The creation in vitro of highly transmissible 
forms of H5N1 is a fresh reminder of how 
the endgame in infectious-disease control 
has changed. 

For years, scientists and officials have 
believed that the ultimate application of dis- 
ease control was eradication. In a world free 
from a particular pathogen, the disease no 
longer occurs, and the expense — and small 
risk — of vaccination is avoided. As a result, 
there would be no need to develop and 
maintain vaccine stockpiles. This idea began 
to change in 2002, when the poliovirus was 
synthesized de novo. The change has been 
profound — the endgame must now take 
into consideration the possibility that con- 
solidation and safe storage, or destruction, 
of remaining polioviruses is not sufficient to 
eliminate the risk that it will be reintroduced. 
With the possibility of synthesizing a virus, 
it is likely that we will always need to have 
vaccines readily available. 

But some say that maintaining samples 
of deadly pathogens is a necessary precau- 
tion. Take the discussions about the samples 
of smallpox virus (Variola), maintained at 
two WHO-sanctioned laboratories since the 
virus’s eradication. Some now argue that we 
need these samples to study the virus, and to 
develop new vaccines and therapies in case 
of a bioterrorist attack. Indeed, the WHO 
Advisory Group of Independent Experts that 
reviews the smallpox research programme 
noted this year that DNA sequencing, clon- 
ing and gene synthesis could now allow 
de novo synthesis of the entire Variola virus 
genome and creation of a live virus, using 
publicly available sequence information, at 
a cost of about US$200,000 or less. 

It is tempting to argue that, for the sake of 
public health, research that constructs highly 
pathogenic forms of viruses, such as H5N1, 
should be squashed. But even if we elimi- 
nated H5N1 and destroyed all lab samples, 
we would still need to consider keeping and 
producing a vaccine, because the virus could 
potentially be recreated. And stopping this 
type of research would open a dangerous door. 
Although adding and deleting genes can cre- 
ate super-strains that put the entire world at 
risk, such research also helps to develop pub- 
lic-health tools such as vaccines and diagnos- 
tic tests. Preventing this research would also 
prevent us from using all possible scientific 
options to prepare for naturally occurring — 
or deliberately caused — outbreaks. m 
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Philanthropists should pay their fair share of 
research costs, says Patrick Aebischer. 


ith public funding for universities 
diminishing, private and philan- 
thropic sources are increasingly 


being pursued to support academic research. 
Although welcome, this money comes with 
acatch: charities and foundations rarely pay 
the full costs of running a lab — building 
construction, maintenance, utility bills or 
salaries, for instance — and tend to be selec- 
tive in the projects they sponsor. Universities 
therefore need to do more to pass on the true 
costs of research to donors. They must also 
have a balance of income sources to avoid 
bias in their research directions. 

Private foundations have been significant 
sponsors of science since the nineteenth 
century. Until the US National Institutes of 
Health started funding extramural research 
in the mid-1940s, for example, more than 
one-quarter of US medical research was paid 
for by philanthropic organizations. As public 
funding grew, the share contributed by foun- 
dations and charities declined to less than 
4% by 2007 (ref. 1). 

Since then, support from non-profit organ- 
izations — especially those associated with 
rich individuals such as Bill Gates — has risen 
worldwide. In Europe, philanthropic sources 
now supply 6.5% of competitive research 
funding on average — 3-4% in most Euro- 
pean countries and almost 10% in the United 
Kingdom’. At my own institution, the Swiss 
Federal Institute of Technology in Lausanne, 
private sponsorship has tripled, from 3% to 
9% of research income over the past 10 years. 


Biomedicine is benefiting most. And 
nowhere more so than in the United King- 
dom: led by the Wellcome Trust, charities 
accounted for 40% of British biomedical 
research funding in 2009 (ref. 3). 


LEGACY OF URGENCY 
Private institutions, however, have a dispro- 
portionately large influence on the research 
landscape relative to their financial contri- 
bution. Because they can make funding deci- 
sions quickly, their money is welcomed by 
principal investigators looking to fill fund- 
ing gaps. But directed awards can divert an 
entire group’s research towards one end. 
Keen to ensure that results are delivered 
and milestones met, charities’ legacy of 
urgency has taken them away from backing 
long-term research projects and towards sat- 
isfying narrow goals that fit their mission. 
Since the boom in ‘venture philanthropy’ in 
the 1990s, the personal agendas of powerful 
entrepreneurs have accelerated research into 
AIDS; infectious diseases such as malaria 
and tuberculosis; cancer; and neurodegen- 
erative diseases such as Parkinson disease. 
Charities often send the message that not 
one cent invested is spent on anything other 
than finding the cures and so minimize their 
contributions towards a university's overhead 
costs. That means that institutions with many 
privately funded projects are effectively ‘pun- 
ished’ for their success. To meet the higher 
research-infrastructure costs, universi- 
ties may drain resources from education, 
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or diminish ‘expensive’ disciplines such as 
physics, chemistry or engineering, in which 
philanthropic support is scarce. 

Private bodies should not hijack university 
resources. They should contribute a fair share 
of the expense of a sustained research enter- 
prise. To make it easier for them to do so, uni- 
versities should better identify the full cost of 
research activities and pass it on. And because 
most charities operate internationally, these 
overheads should be harmonized worldwide. 


FULL COSTS 

Such an accounting effort is ongoing in some 
countries, especially the United States. But 
the concept of overheads is almost unknown 
in many European countries, where univer- 
sities are largely state supported. Some Euro- 
pean universities are addressing this issue, in 
response to requirements by the European 
granting system to supply full research costs. 

Estimates of overhead costs vary by field 
and by country. In the United States, they 
typically range from 40% to 70% of grant 
income, depending on whether the principal 
investigator's salary is included. But awards 
are usually lower in Europe. For example, in 
2008, the Swiss Parliament agreed to com- 
pensate universities with an overhead of 
approximately 10% of each successful pro- 
ject submitted to the Swiss National Science 
Foundation. Grants from the Seventh Frame- 
work Program and the European Research 
Council support a maximum overhead of 
20%. This is inadequate and should rise. 

Universities can do much to manage their 
varied funding streams. They should develop, 
champion and apply transparent full-cost 
accounting mechanisms. Leaders engaging 
in fund-raising should promote the inclusion 
of overhead costs. And academies must avoid 
propping up underfunded research activities 
with educational resources. 

Ultimately, universities should not rely on 
short-term funding for basics such as infra- 
structure and faculty salaries. They need to 
retain a healthy level of public funding if they 
are to survive long term. Universities have a 
mission of education and of long-term, fun- 
damental research; charities aim to solve 
real-world problems now. Fruitful partner- 
ship should imperil neither but achieve the 
noble goals of both. = 


Patrick Aebischer is president of the Ecole 
Polytechnique Fédérale de Lausanne, 
Lausanne, Switzerland 

e-mail: patrick.aebischer@epfl.ch 
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PUBLIC HEALTH 


Despite threats such as the 2001 US anthrax attacks, vaccine innovation has slowed in the past 40 years. 


Biodefence built 
on teamwork 


The golden era of US vaccine research holds beneficial 
lessons for today, finds John Grabenstein. 


iomedical researchers who are familiar 
B= the US National Institutes of 
Health as a funding body and agenda- 
setter might be surprised by Long Shot. 
Science historian Kendall Hoyt shows that 
there was once a different way to drive medi- 
cal progress. In the 1940s and 1950s, focused 
expertise and goal-oriented collaboration 
between the military, academia and industry 
yielded key vaccine innovations that bolstered 
national security, yet had benefits for civil- 
ians. That era of medical discovery stands in 
marked contrast to today’s patent constraints, 
regulatory hurdles and disjointed efforts. 
The book is timely, with the United States 
and other countries continuing to search 
for pharmacologi- 
cal defences against 
unforeseen biologi- 
cal threats such as 
the 2001 US anthrax 
attacks, which killed 


> NATURE.COM 
For more on civilian 
and military disease 
collaborations: 
go.nature.com/thfn8s 


5 people and infected 17. The current US 
biodefence effort has been expensive — less 
so when compared with the cost of certain 
military aircraft — yet it has not focused 
adequately on developing products that are 
usable. Recalling a time when introducing 
vaccines seemed easier could provide salu- 
tary lessons for today. 

Hoyt begins her deft treatment by describ- 
ing the threats from pathogens and how she 
measures vaccine innovation, then reviews 
vaccine research during and after the Second 
World War. She also discusses recent fac- 
tors that have disrupted vaccine-producing 
government-industrial networks and led to 
contemporary frustrations. 

In the 1940s, vital research aiming to create 
practical products for protecting soldiers 
and nations overwhelmed theoretical stud- 
ies. Most developments were not a response 
to enemy bioweapons, but rather to viral and 
bacterial pathogens in countries where troops 
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were based, or to diseases linked to crowding 
or poor hygiene. For example, influenza A 
and B vaccine was developed in response 
to the 1918-19 pandemic that killed tens of 
millions of people; pneumococcal polysac- 
charide vaccine came about because of the 
crowded conditions in barracks that often led 
to pneumonia; and typhus and tetanus vac- 
cines were produced to combat two common 
diseases faced by troops throughout history. 

As Hoyt shows, the urgent need to find 
vaccine solutions drove highly effective 
collaborations between various military, 
academic and industrial research bodies. 
But that urgency led to stumbles too. When 
production of yellow-fever vaccine was 
scaled up to enable vaccination of US Army 
troops in the 1940s, the vaccine was stabi- 
lized with human serum that was unknow- 
ingly contaminated with hepatitis B virus, 
leading to almost 50,000 hospital cases. 
Formulation changes later allowed the 
vaccine to be used to prevent yellow fever 
across broad swathes of the globe. 

The book’s narrative is enriched by a 
description of how the US Office of Scien- 
tific Research and Development, the Office 
of the US Army Surgeon General and the 
War Research Service conducted vaccine- 
research planning and operations during the 
war. With peace in 1945, wartime scientific 
relationships continued to bear fruit, such as 
vaccines against adenovirus and meningo- 
coccal infections. She lauds collaborations 
throughout the 1960s between the Walter 
Reed Army Institute of Research in Silver 
Spring, Maryland, and vaccine firm Merck, 
Sharp and Dohme in West Point, Pennsylva- 
nia. She describes them as science integrators 
that brought together experts from disciplines 
including medicine, immunology and virol- 
ogy, all focused on common goals. Much of 
their success, she says, is due to that focus and 
to teamwork that avoided over-specialization. 

To measure vaccine innovation between 
1900 and 1999, Hoyt reconstructs historical 
records of annual US vaccine introductions. 
This was a challenge: responsibility for vac- 
cine registration (or licensing) was trans- 
ferred from bureau to bureau. Between the 
establishment of the 
US Hygienic Labora- 
tory in 1902 to that 
of the Center for Bio- 
logics Evaluation and 
Research in 1987, there 
were seven institu- 
tional transitions, and 
multiple variations in 
record-keeping. 

According to Hoyt, 
vaccine innovation 
activity peaked in 
the 1940s at 50 inno- 
vative licences per 
decade, remained at 


Long Shot: 
Vaccines for 
National Defense 
KENDALL HOYT 
Harvard University 
Press: 2012. 320 pp. 
$29.95, £22.95 
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35-40 licences per decade through the 
1960s, and declined steadily to 12 per 
decade in the 1990s. The numbers reflect 
all vaccines developed for endemic and 
exotic pathogens. She convincingly con- 
tests similar analyses. 

Today’s biological revolution cuts both 
ways, offering great advances in medicine 
while providing new means of attack for 
terrorist groups. It is thus short-sighted 
that US biodefence funding is largely 
overseen by congressional committees 
that are oriented towards health rather 
than national security. 

If the 1940-69 model was a worthy 
paradigm, why did we migrate away from 
it? Hoyt attributes the change to several 
factors. Government contracting and 
licensing became mired in bureaucracy, 
and concerns over intellectual property 
curtailed collaborations. Oversight of 
vaccine development across a number of 
bodies became inadequate. The political 
constituency for biodefence issues weak- 
ened and funding for products suffered. 
Arguably, the lack of US government 
prioritization of vaccines was the most 
damaging factor. The result is that now, 
the time it takes to develop a vaccine is 
increasing. A better balance of basic and 
applied research could restore a product- 
oriented focus. 

Government-sponsored research of 
bioweapon countermeasures has gener- 
ated knowledge and publications, but 
faltered in delivering practical results. 
Biodefences cannot be put in place solely 
by accepting or dismissing research 
hypotheses; they require safe and effec- 
tive pharmaceutical products. We need 
the biological equivalent of the defences 
that now protect Hawaii’s Pearl Harbor. 

So what should society and govern- 
ments do next? At this point, the book's 
contributions fade. Hoyt dismisses most 
vaccine stockpiles as unresponsive to 
today’s threats, yet doesn’t indicate any 
need for quick-acting antibody for- 
mulations or therapeutics. Instead, she 
advocates an emphasis on multipurpose 
technologies and ‘platforms, many of 
which are hypothetical. How can we meas- 
ure success using a platform approach? 
How quickly could platforms be trans- 
formed into defences against bioweapons? 

Perhaps Hoyt’s idea is the right interim 
solution for today’s nadir in national will. 
Perhaps later, ifand when it returns, the 
well-coordinated, product-centric devel- 
opment of vaccines can resume. History 
offers us lessons in how to do so. = 


John Grabenstein is a senior medical 
director at Merck Vaccines, West Point, 
Pennsylvania, USA. 

e-mail: john_grabenstein@merck.com 
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Cosmic creation 


Pedro Ferreira explores Alan Lightman’s latest novel — 
a magical-reality take on the origins of the Universe. 


hen a physics heavyweight is 
mentioned in the same breath as 
Salman Rushdie and Italo Calvino, 


itis tough fora reviewer. Few venture into air 
that rarefied and make it out alive. But when 
the bookis Mr g, a creation myth by physicist 
Alan Lightman, it is worth the risk. 

In Mr g, Lightman has taken the core of 
what we know about the origins of the Uni- 
verse from physics, chemistry and biology 
and wrapped a few characters around it. The 
protagonist is the narrator: god, dubbed Mr g. 
Mr g lives in a timeless Void with his Aunt 
Penelope and Uncle Deva, playing out skits 
that could have been lifted from a Woody 
Allen film — but with the humour on mute. 
One day, Mr g wakes up from a nap, decides 
to create a Universe called “Aalam- 104729”, 
and from then on marvels at his creation as 
it evolves and becomes more complex — 
from the beginning of space and time, to the 


2012 
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emergence of funda- 
mental laws, particles, 
forces, stars, galaxies, 
planets and, ultimately, 
life itself. 

As sentient beings 
finally emerge out 
of the cosmic mess, 
Mr gis torn: should he 
intervene or let them 
go their own way? 
Throughout, he is 
taunted by the creepy 
Belhor, a devilish char- 
acter (a fine role for Al 
Pacino if this were ever made into a film) and 
Belhor’s annoying daemons, the Baphomet 
siblings. Belhor pushes Mr g to allow his crea- 
tions to do their own thing, and watches with 
glee as evil and unhappiness begin to emerge 
— leaving Mr g to observe as, for instance, 


Mr g: A Novel 
About the Creation 


Pantheon/Corsair: 
2012. 224 pp. 
$24.95/£9.99 
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an impoverished young woman anguishes 
over stealing meat to feed her starving sib- 
lings. Lightman uses the exchanges between 
Mr gand Belhor to riff on good and evil, free 
will and relative morality. 

Lightman’s grasp of the science, in all its 
gory detail, is unerring. His ability to inter- 
weave the fantastical with the factual is 
impressive — not surprising, given his back- 
ground. In the 1970s, Lightman established 
himself as an astrophysical relativist, writing 
several important papers and books. I still 
use them. In 1993, he published a magical 
collection of short stories, Einsteins Dreams 
(Pantheon), in which he took the strange con- 
cepts and consequences of Einstein's principle 
of relativity and wove them into vignettes. I 
was shocked at how well it worked — and I 
was not alone. The book has been translated 
into 30 languages and has led to several stage 
productions around the world. 

Lightman had done something that I had 
thought impossible: he had brought in the 
hard science and softened it up. Einstein's 
Dreams is sensual; it breathes. Mr g is dif- 
ferent. Much more abstract and almost 
pedagogical, it is a detailed description of the 
birth and evolution of the Universe that reads 
like a Rushdiesque fable about an invented 
place and time. The facts are faultless. Yet the 
book can occasionally be unintelligible. 

For example, Lightman defines a tick ofa 
clock in terms ofa particular frequency of the 
hydrogen atom. Beautifully precise and clear. 
But asa result, any other timescales that he 
mentions must be written in scientific nota- 
tion. And, of course, other big numbers — 
such as the number of neurons in a brain or 
of Universes in what he calls the Void — can 
also be accurately presented only in scientific 
notation. You begin to wonder about the 
readership. Are these nuggets meant to be 
seen as just icons, like hieroglyphs? Or is the 
generalist supposed to know what they mean? 

It has to be said too that Lightman’s work 
could have worked well as a short story. It 
loses steam early on and sometimes feels like 
interspersed vignettes on science and moral- 
ity with a dose of ‘magic thrown in. Finely 
crafted it may be, but it can be hard going. 
Throughout, I hada pretty good idea of what 
was going to happen and, given the nature 
of the story, the characters didn't need to be 
developed. There was nothing to wait for. 

Yet ultimately, this is a marvellous counter- 
point to all of the other nonsense out there on 
creation. Lightman writes exquisitely, so this 
fable on the origin of space, time, matter and 
life is a wordfest that is securely pinned to the 
rational — making him a ‘magic realist’ ofa 
refreshingly different stripe. = 


Pedro Ferreira is professor of astrophysics 
at the University of Oxford, UK, and a 
member of the Oxford Martin School. 
e-mail: p.ferreiral@physics.ox.ac.uk 


Books in brief 


The Green Paradox: A Supply-Side Approach to Global Warming 
Hans-Werner Sinn MIT PRESS 288 pp. $29.95 (2012) 

Frustrated by “counterproductive” climate policies, economist 
Hans-Werner Sinn proposes a radical alternative. Policy-makers, he 
notes, have ignored the oil sheikhs and coal barons who supply the 
fossil-fuel market. Yet it is they who call the shots, as shown by the 
‘green paradox’ — announcements of future reductions in carbon 
consumption that drive carbon-resource controllers to bump up 
production. Sinn’s antidote to the ideology that plagues policy- 
making is a “Super-Kyoto” system: unified countries, coordinated 
caps and trade, and taxation designed to curb the ‘extraction habit’. 


( — The Happiness of Pursuit: What Neuroscience Can Teach Us About 
| the Good Life 

Shimon Edelman Basic Books 256 pp. $25.99 (2012) 

The mind, says cognitive psychologist Shimon Edelman, is a 

literal “meat computer”. Our experience of the world is a series of 
computations carried out by neural wetware. But where does that 
leave philosophical conundrums such as joy? Taking passages by 
luminaries including Homer, William Shakespeare and Jorge Luis 
Borges as touchstones, Edelman powers along on his “quest for an 
algorithmic understanding of happiness”, revealing that it is this 
computational journey itself that constitutes the good life. 


How Not to Be Eaten: The Insects Fight Back 

Gilbert Waldbauer UNIVERSITY OF CALIFORNIA PRESS 240 pp. 

$27.95 (2012) 

Insects and their predators use a vast, often bizarre array of 
strategies to eat or to avoid being eaten. Entomologist Gilbert 
Waldbauer tours tactics on both sides, from sticky lures to kamikaze 
speed. His fascinating cast ranges from goatsuckers (birds in the 
order Caprimulgiformes) that can catch massive moths in their 
bristled gapes, to the brilliantly hued lubber grasshopper (Romalea 
y guttata), which, when threatened, both vomits a noxious substance 
and hisses as stinking froth erupts from its thorax. 


Knowing Nature: Art and Science in Philadelphia, 1740-1840 
Edited by Amy R. W. Meyers and Lisa L. Ford YALE UNIVERSITY PRESS 
432 pp. $65 (2012) 

When writer Thomas Paine saw Philadelphia in 1774, he noted that 
nearly every citizen had “some scientific interest or business” — a 
state that persisted to the mid-nineteenth century. The city boasted 
artist-naturalists such as John James Audubon, and the study of 
nature filtered through all strata of society. These 14 illustrated 
essays, edited by art historians Amy Meyers and Lisa Ford, trace a 
century of influential scientific endeavour, from botanic gardens and 
cabinets of curiosity to pioneering colour-plate techniques. 


Vegetables: A Biography 


> Evelyne Bloch-Dano (translated by Teresa Lavender Fagan) UNIVERSITY 
& OF CHICAGO PRESS 128 pp. $20 (2012) 
Next time you chop a carrot, spare a thought for the centuries of 
YEERTABL ES | breeding, farming technology and trade that brought it to your 

=] kitchen. In brief ‘biographies’ of 11 vegetables, from pumpkins and 

“| artichokes to chard, writer Evelyne Bloch-Dano serves up a feast of 
8 associated genetics, agriculture, history and culture. From the role of 
- yy chillies in world trade to the strange link between peas and the court 


4 of Versailles in France, there is much to savour. 
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Ri. 4 


Biologist Edward Atkinson in the lab at Cape Evans hut, base camp of the Antarctic Terra Nova expedition. 


ANTARCTICA 


Scientists to the end 


Colin Martin celebrates a London exhibition revealing 
the research legacy of Robert Scott’s final journey. 


in polar history. One hundred years ago 
this week, British Antarctic Expedition 
leader Robert Falcon Scott and his four 
companions reached the spot where, 33 days 
previously, Roald Amundsen and his party 
had planted a Norwegian flag. All five mem- 
bers of Scott’s party died while attempting 
to return to their base camp at Cape Evans. 
That tragic outcome has tended to 
overshadow the expedition’s scientific 
achievements. In Scott’s Last Expedition, 
London’s Natural History Museum aims 
to redress the balance. As curator Elin 
Simonsson explains, “We want people to 
know that Scott's British Antarctic Expedi- 
tion was not simply a quest to reach the South 
Pole, but an important scientific expedition 
that carried out work across many fields.” 
Those fields included glaciology (then in its 
infancy), biology, magnetism, meteorology, 
chemistry and physics. The scientific team 
that participated in Scott's Terra Nova expe- 
dition of 1910-13 numbered 12 — the largest 
sent to the Antarctic at that time — including 
zoologist Edward Wilson, geologist T. Griffith 


[: is one of the most poignant moments 


Taylor and meteorolo- 

gist George Simpson. ONATURE.COM 
When the ship Terra _ For more on Scott's 

Nova returned to Eng- __ legacy: 

land, it wasladen with —_go.nature.com/Sfhuuj 


Scott’s Last Expedition 
Natural History Museum, London. 
20 January to 2 September 2012. 


These Rough Notes 
Scott Polar Research Institute, Cambridge, UK. 
Until 5 May 2012. 


specimens of 2,109 animals, plants and fossils 
— with more than 401 new to science. 
Geological specimens (1,919 of 
which are in the museum’s 
permanent collection) helped 
to construct knowledge of the 
continent. The meteorological 
data — the longest unbroken 
record of weather collected in the 
early twentieth century — pro- 
vide baselines that are now used 

in assessing climate change. 

The shore party of 25 men 
lived and worked in a wooden 
hut measuring 112 square metres 
(about half the area of a singles 
tennis court). Although the hut 
was equipped with scientific instru- 
ments, the scientists’ real lab was the 
Antarctic. The exhibition includes 
a full-size representation of the hut 
(the original is preserved at Cape Evans), 
with actual artefacts from the expedition on 
display, including scientific apparatus and 
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notebooks as well as a range of biological 
and geological specimens. 

These include the shells of three eggs from 
the emperor penguin (Aptenodytes forsteri; 
pictured), which contained embryos at 
different stages of development. Wilson, 
his assistant Apsley Cherry-Garrard and 
marine lieutenant Henry Bowers collected 
them under unimaginably harsh conditions, 
trudging to the penguin breeding colony 
at Cape Crozier — a trip immortalized by 
Cherry-Garrard in his 1922 book The Worst 
Journey in the World. The team’s painstak- 
ing preservation and study of the embryos 
ultimately came to little, as theories about 
bird evolution from reptiles had changed by 
the time their report was published in 1934. 

However, the fossils of the extinct plant 
Glossopteris indica, found by Scott and his 
final companions, provided important evi- 
dence that the climate of Antarctica once 
supported vegetation and was a part of the 
supercontinent Gondwana. 

The expedition is also documented by 
photographs, many by photographer Her- 
bert Ponting. These include an iconic image 
of Scott writing in his diary, and parasitic- 
infection specialist Edward Atkinson work- 
ing at the bench. Similarly informative are 
Scott's own ‘lost’ photographs. Only a hand- 
ful of the 120 images he took under Ponting’s 
tutelage had been available until last year, 
when Wilson’s great-nephew, polar historian 
David M. Wilson, published The Lost Photo- 
graphs of Captain Scott (Little, Brown; 2011). 
The snowy panoramas and shots of expedi- 
tion life help to reveal the daily realities ofhis 
last endeavour, while polar scientists still use 
the images for comparison with contempo- 
rary photographs to assess climate impacts. 

A concurrent exhibition, These Rough 
Notes, is showing at the Scott Polar Research 
Institute in Cambridge, UK. It takes its title 
from a journal entry written by Scott shortly 
before his death — “These rough notes and 
our dead bodies must tell the tale...” — 
and displays personal papers of Scott 

and other expedition members. They 
record the heartbreaking poignancy 
of the polar party’s death. But as 
the exhibition at the Natural His- 
tory Museum shows, we need to 
reinforce the dual importance of 
this extraordinary expedition. “We 
want the scientific work to make 
the bagging of the Pole merely an 
item in the results,’ wrote Wilson at 
the expedition’s outset. He didnt, of 
course, foresee that his own fate, and 
that of Scott and other colleagues, 
would long eclipse an enduring 
scientific legacy that led to the 

publication of 81 papers. m 
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Rare day to highlight 
rare diseases 


In the last leap year in 2008, 

29 February was chosen to 
mark Rare Disease Day by 
EURORDIS, the European 
Organisation for Rare Diseases. 
This year, this rare day will 
again serve to raise awareness of 
the plight of patients with rare 
disorders. 

Rare diseases fail to attract 
commercial research efforts 
because of the small number of 
people affected (fewer than 5 in 
10,000). Examples include the 
nerve disorder Charcot—Marie- 
Tooth disease and Proteus 
syndrome, which causes excess 
overgrowth of tissues. 

As aresult, only a handful 
of scientists, often working in 
isolation, dedicate their research 
to a particular rare disease. But 
such select studies can deliver 
groundbreaking insights into 
more general disease processes. 
Exposure of molecular 
mechanisms underlying rare 
diseases therefore has the 
potential to help many other 
patients. 

Also on 29 February, the 
first international congress 
dedicated to rare diseases will 
be held in Basel, Switzerland 
(www.react-congress.org). 

It will highlight the recently 
established International Rare 
Diseases Research Consortium 
(IRDiRC), which plans to 
coordinate international 
policy-making with results 
from national research 
projects. These initiatives 
should eventually translate 
scientific advances into benefits 
for patients. 

Susan M. Gasser Friedrich 
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Research, Basel, Switzerland. 
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Asian medicine: 
small species at risk 


The demands of traditional 
Asian medicine (TAM) don't 
just pose a threat to the survival 
of tigers and rhinos (Nature 480, 
$101-S103; 2011). Numerous 
smaller species are also at risk, 
as a result of being traded in 
large volumes. 

For example, millions of 
dried seahorses (Hippocampus 
spp.) and Tokay geckos (Gekko 
gecko) are sold annually for 
use against impotence and 
circulatory problems in TAM. 
Gecko sales have been further 
fuelled by rumours that larger 
individuals can cure HIV. 
Trade in dried orchid parts for 
eye ailments and in ginseng 
rhizomes is consuming plants 
in their millions. The decline in 
Asias turtle population is being 
aggravated by the marketing 
of TAMs that contain turtle 
plastrons (the flat underbelly of 
the shell) to treat skin diseases. 

The Convention on 
International Trade in 
Endangered Species seeks to 
protect most of these globally 
threatened species. The high 
demand for scaly anteaters, 
or pangolins (Manis spp.), 
for example, has led to an 
international trade ban. 
However, the scales of tens of 
thousands of pangolins are still 
confiscated annually in Asia. 

For some species, captive 
breeding and plant propagation 
have been successful, but the 
demand for wild-sourced 
ingredients means that such 
techniques cannot wholly 
mitigate the impact of trade. 

The killing of endangered 
tigers and rhinos for TAM 
will not go unnoticed. But the 
insidious destruction of smaller 
species must also be halted by 
enforcing stricter regulations. 
Vincent Nijman, K. Anne- 
Isole Nekaris Oxford Brookes 
University, Oxford, UK. 
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Call to split fisheries 
at home and abroad 


I agree that “no one should doubt 
that our seas need protection” 
(Nature 480, 151; 2011) and that 
establishing marine protected 
areas alone will not do the job 
(Nature 480, 14-15; 2011).A 
combination of measures is 
needed, including the elimination 
of overfishing subsidies. These 
incentives were introduced when 
fisheries seemed inexhaustible, 
but they inflate profitability and 
drive fishing beyond economic or 
sustainable levels. 

The global community 
mandated the World Trade 
Organization (WTO) to discipline 
overfishing subsidies more than 
ten years ago, but the issues were 
still unresolved at their meeting 
last month. One reason is that 
WTO negotiators are trying 
to broker an all-inclusive deal 
that encompasses domestic and 
international, small- and large- 
scale fisheries. But this approach is 
hindered by national interests. 

The answer is to split the 
world’s fisheries into domestic 
and international ones. 
Domestic fisheries would 
operate within a country’s 
economic exclusion zone and 
target fish stocks that spend all 
their lives there. This split is 
necessary because the incentives 
to eliminate overfishing 
subsidies differ according to 
whether a fishery is domestic or 
international. 

For a domestic fishery, the 
heavy lifting should be on the 
home front; for an international 
fishery, global coordination 
would be needed, because 
unilateral action by one country 
will not eliminate overfishing. 
Categorizing fisheries in this way 
would make it easier to identify 
leverage points for eliminating 
overfishing subsidies. 

U. Rashid Sumaila University 


of British Columbia, Vancouver, 
British Columbia, Canada. 
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Investors unfazed by 
drug-patent expiry 


Your data on changing stock 
prices for five pharmaceutical 
companies from 1997 to 2010 
(Nature 480, 16-17; 2011) 
erroneously indicate a 39% drop 
in aggregated share value, when 
in fact it would have risen by 82% 
(see Correction, Nature 480, 425; 
2011). The error was due mainly 
to the selection of unadjusted, 
rather than adjusted, stock- 
market closing prices. 

The corrected data set indicates 
that the aggregated share prices 
rose roughly in line with the 
Dow Jones Industry Average. 
Considering that these companies 
have the biggest patent cliffs in 
the industry, this result is very 
positive; moreover, it argues 
against your implication that the 
impending expiry of their patents 
is affecting investor confidence in 
pharmaceutical stocks. 

Investors accept that the 
strength of a company’s patent 
portfolio is only one of several 
factors. Hence, despite the 
impending expiry of patents for 
blockbuster products, including 
Pfizer's Lipitor, the performance 
of the five worst-hit companies 
was comparable with that of other 
sectors for the period 1997-2010. 

Other factors such as 
decreasing revenues can also 
affect investor confidence. Your 
simplistic view that the patent 
cliffis the only contributor does 
an injustice to the efforts of 
pharmaceutical companies and 
governments, and unnecessarily 
perpetuates a false uncertainty. 
David Brindley University College 
London, UK; Harvard Stem Cell 
Institute and Harvard Business 
School, Massachusetts, USA. 
david_brindley@harvard.edu 
Brock Reeve The Harvard 
Stem Cell Institute, Cambridge, 
Massachusetts, USA. 
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Paul Mead Doty 


(1920-2011) 


Chemist and arms-control advocate who unravelled the structures of biomolecules. 


failure at home on 5 December 2011 
at the age of 91, leaves three legacies: 
his revelation of the structures of numerous 
proteins and nucleic acids; his contribu- 
tion to our very survival through his tireless 
activity on behalf of arms control and disar- 
mament; and the many successful scientists 
and policy figures whom he mentored. 
Doty was born on 1 June 1920 in 
Charleston, West Virginia, and was 
schooled in Pennsylvania. His excep- 
tional talent was recognized at high 
school, where a chemistry teacher 
asked him to take over the class. Doty 
earned a Bachelor's degree in chemistry 
at Pennsylvania State University (then 
Pennsylvania State College) in Univer- 
sity Parkin 1941 and a PhD in chemistry 
at Columbia University in New Yorkin 
1944. He was renowned for being late. 
On the first day of classes at Columbia, as 
the students were seated in alphabetical 
order, he missed his place. So Doty sat at 
the end, and thereby made the acquaint- 
ance of Bruno Zimm, a friendship that 
lasted a lifetime and would lead to 
seminal scientific collaborations. 
During his doctorate, Doty measured 
the electron affinity of bromine. He 
moved into the new field of polymer sci- 
ence and, with Bruno Zimm, pioneered 
the theory and use of light scattering to 
characterize polymer molecules in solu- 
tion. A postdoctoral position at the Poly- 
technic Institute of Brooklyn, New York, 
brought Doty into contact with found- 
ers of polymer science, such as Herman 
Mark, and alerted him to the unusual 
chemistry and physics of macromolecules. 
During a second postdoc, at the University 
of Cambridge, UK, Doty learned, mainly 
from Max Perutz, about the X-ray diffrac- 
tion of protein crystals, which allowed their 
structures to be determined. 


p= Mead Doty, who died of heart 


BIOLOGICAL MACROMOLECULES 
After briefly joining the faculty at Notre 
Dame University, Indiana, Doty accepted an 
assistant professorship at Harvard Univer- 
sity in Cambridge, Massachusetts, in 1948. 
He remained at Harvard for the rest of his 
life. During his tenure there, he founded the 
Department of Biochemistry and Molecu- 
lar Biology and the Center for Science and 
International Affairs. 

Doty’s research focus shifted at Harvard 


to large biological molecules. Realizing 
that proteins could not be random poly- 
mers, because they could be crystallized 
and had unique structures, he established 
a programme to study synthetic polypep- 
tides. These chains of amino acids allow 
structural transitions, such as from helix to 
random coil, to be followed by a technique 
called optical rotatory dispersion. By tracing 


structural transitions induced by changes in 
temperature, he was able to estimate the frac- 
tion of each protein that was helical. These 
results laid the foundations for modern stud- 
ies of protein folding. 

He then, in the 1950s, turned to nucleic 
acids, a step made possible by the availability 
of synthetic polynucleotides. Out of this 
work came the discovery that DNA structure 
could be altered using heat, that the melting 
temperature of DNA from various bacteria 
differed according to the fraction of G and 
C nucleotides in it, and that RNA contains 
regions of secondary structure. These 
observations are the basis of much modern 
biotechnology, including DNA sequencing 
and the polymerase chain reaction. 

Doty’s strength lay in identifying 
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important scientific issues with conceptual 
and practical implications that could be 
resolved by devising simple model systems. 
He was adept at finding the right person 
for a particular job, hiring Jacques Fresco 
for the polynucleotide work, for example, 
and Julius Marmur for studies using DNA. 
He was discriminating in how he worked 
with his students, balancing a laissez-faire 
attitude with specific guidance accord- 
ing to each student’s needs. As PhD 
students, we both benefited from this 
style of mentorship. 


PUBLIC AFFAIRS 

His most important choice of associate 
wasa graduate student, Helga Boedtker, 
whose thesis on the structure of colla- 
gen in solution became a landmark. 
They married, and Helga became the 
mainstay of the laboratory for more 
than 40 years until her death in 2001. 
When Doty was absent from the lab 
during his trips abroad to promote arms 
control, a quiet word to Helga brought 
urgent matters to his attention. 

Very early in his career, Doty became 
active in science and public affairs, with 
a focus on US-Soviet relations. He 
chaired the Federation of American 
Scientists and the US Pugwash Com- 
mittee, worked on disarmament plans 
under President Dwight Eisenhower's 
science adviser James Killian, and was 
a member of President John F. Kenne- 
dy’s Science Advisory Committee. Doty 
co-chaired the Soviet-US disarmament 
committee (now the Committee on 
International Security and Arms Con- 
trol of the National Academy of Sciences), 
helped to arrange the first visits to Moscow 
of US secretaries of state Henry Kissinger 
and Cyrus Vance and played a key part in 
developing the basis for the US-Soviet 
Anti-Ballistic Missile Treaty. 

As noted by molecular biologist Matthew 
Meselson, Doty “surely helped to avoid 
catastrophe”. = 
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materials opens the door to the hierarchical 
assembly of functional materials that could 
react sensitively to changes in ambient con- 
ditions or to mechanical stimuli. One way to 
extend the functionality of their system could 
be to make colloidal particles that are not only 
chiral, but also have surfaces patterned with 
microscopic domains that attract or repel each 
other. Such patterned colloidal particles have 
recently been shown to self-assemble into useful 
functional structures such as ‘kagome lattices’. 

Chiral interactions between particles occur 
in all kinds of materials, from liquid crys- 
tals** to cytoskeletal filaments in cells (for 
which the hierarchical assembly of bundle- 
like structures is dependent on, and sensitive 
to, the helical twist of the filaments”*). Most 
of the chiral materials studied so far are 
passive or in thermal equilibrium. This means 
that their structural assembly is governed 
only by diffusion and by local interactions 


The path to 


between the constituent particles. But most 
naturally occurring materials are far from 
passive. Instead, they constantly consume 
energy so that their particles self-organize into 
higher-order structures — the cytoskeletons 
of cells are prime examples of this”””. 

The next step, therefore, is to apply the 
principles identified by Gibaud and colleagues’ 
to active systems. Imagine replacing the 
viruses used in this study with self-propelling 
chiral bacteria — would they still pack together 
in a two-dimensional membrane, and, if so, 
would the membrane move, or rotate, across 
mesoscopic or macroscopic distances? Could 
actively beating ribbons form, or would the 
noise stemming from active movement of the 
bacteria prevent the formation of such struc- 
tures? All we can say for certain is that chiral 
interactions will add a new twist not only to 
the self-assembly of colloids, but also to the 
self-organization of active materials. m 


retinoblastoma 


Genomic analyses of tumours of the childhood cancer retinoblastoma reveal a 
low mutation rate, challenging the view that genomic instability is crucial for its 
progression. The work also identifies a new therapeutic target. SEE ARTICLE 2.329 


JULIEN SAGE & MICHAEL L. CLEARY 


etinoblastoma is a rare tumour that 
Rites retinal cells in the eyes of 
children. Analyses of familial and 
sporadic cases of this cancer, backed by stud- 
ies in genetically engineered mice, have shown 
that loss of function of the tumour-suppressor 
protein RB1 (also known as RB) is required for 
the development of most, if not all, tumours 
of this type. However, it is not clear how RB1- 
deficient retinal cells progress to malignant 
tumour cells’. In addition, emerging evidence 
that loss of RB1 function can induce genomic 
instability” has raised the tantalizing possi- 
bility that RB1-deficient retinal cells might 
be predisposed to accumulating many addi- 
tional mutations, further complicating the 
identification of mutations that contribute to 
the development and maintenance of retino- 
blastoma. On page 329 of this issue, however, 
Zhang et al.’ demonstrate that retinoblastoma 
genomes have very few recurrent mutations in 
genes other than RB1. Instead, the expression 
of cancer-related genes is affected by epigenetic 
modifications on chromosomes, which do not 
affect DNA sequence but are inherited after 
cell division. 
To identify mutations that could cooperate 


with loss of RB1 function in tumour develop- 
ment, Zhang and colleagues’ sequenced and 
compared the genomes of normal tissue 
and retinoblastoma tumours from four 
patients. The researchers found that RB1 was 
the only known cancer-related gene consist- 
ently mutated, and that the retinoblastomas 
had 15-fold fewer total mutations than other 
types of solid tumour whose genomes have 
been sequenced’. 

Next, the authors searched for epigenetic 
alterations and for abnormal gene expression 
in retinoblastoma cells. They identified the 
gene that encodes the protein kinase enzyme 
SYK asa potential oncogene whose expression 
is consistently higher in retinoblastoma cells 
than in normal immature retinal cells. More- 
over, the activity of SYK was essential for the 
growth of retinoblastoma cells. The authors 
also show that certain small molecules that 
selectively inhibit SYK activity induce the 
death of retinoblastoma cells in a mouse model 
of the disease. 

These findings’ indicate that SYK may be 
a promising target for treating patients with 
retinoblastoma. SYK was not identified in 
previous searches for genes with a role in this 
cancer because it is not mutated or structur- 
ally rearranged in retinoblastoma. Therefore, 
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Zhang and colleagues’ study emphasizes the 
importance of high-throughput approaches 
that integrate genome sequencing with 
gene-expression analysis and epigenomics to 
identify cancer genes. 

Low mutation frequency has also been 
observed’ in medulloblastoma (a type of brain 
tumour that affects children), suggesting that 
it could be a general feature of childhood can- 
cers. A possible explanation for this difference 
between the cancers of children and adults is 
that childhood tumours arise in cells that are 
naturally undergoing rapid developmental 
growth, with fewer brakes on their proliferation 
than cells in adults. An alternative explanation 
is that, in children, these cells are negotiating 
crucial developmental checkpoints that are sus- 
ceptible to corruption, leading to incomplete 
or abnormal maturation. In both cases, only a 
few mutations would be needed to trigger the 
cellular changes associated with cancer. 

Furthermore, epigenetic changes in children 
and excessive mutations in adults may have 
similar roles in cancer development. Another 
childhood cancer, Wilms tumour, also has 
a relatively stable genome and displays an 
increased variation in DNA-methylation 
patterns compared with normal cells®. RB1- 
deficient retinal cells may be particularly 
susceptible to this tumour-formation mecha- 
nism, because RB1 interacts with the machin- 
ery that controls the epigenetic status of cells, 
including enzymes that remodel chromatin 
(DNA-protein complexes) and other enzymes 
that add methyl groups to DNA’. Thus, child- 
hood cancers such as retinoblastoma may 
carry epigenetic abnormalities that change 
gene expression and are stably propagated 
through subsequent cell divisions, helping to 
maintain tumour-associated features. 

If this model is correct, it is possible that 
RB1-deficient tumours in adults — such 
as small-cell lung cancer and some breast, 
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prostate or blood cancers — also display 
epigenetic changes that are crucial for tumour 
development and maintenance. Future experi- 
ments, including integrated genomic analyses, 
may provide the answer. 

The underlying mechanism of alterations 
in gene expression by epigenetic means in 
retinoblastoma is still unclear. But Zhang and 
co-workers’ observation’ that 13% of these 
tumours show recurrent mutations in the 
BCOR gene offers a possible explanation. The 
BCOR protein is highly expressed in the fetal 
retina and is essential for eye development, 
as evidenced by a congenital eye disorder, syn- 
dromic microphthalmia, which results from a 
heritable mutation in BCOR’. BCOR associates 
with proteins that repress gene expression 
epigenetically*. This raises the possibility 
that loss of BCOR function, due to acquired 


EVOLUTIONARY BIOLOGY 


mutations in its gene, may contribute to an 
altered epigenetic landscape in RB1-deficient 
retinal cells. Consequently, inappropriate 
expression of crucial genes may impair cell 
maturation and so facilitate the progression of 
retinoblastoma. It will be of interest to deter- 
mine whether BCOR is part of the repression 
machinery that silences SYK expression in the 
normal retina, and whether acquired BCOR 
mutations provide at least one route to altered 
expression of SYK in retinoblastoma. 

Nevertheless, the current work — using a 
comprehensive, integrated genomics approach 
— is notable not just for demonstrating that 
epigenetic alterations have a predominant role 
in the progression of retinoblastoma. The new 
possibilities it raises for therapies in this child- 
hood malignancy, and possibly in other types 
of tumour, are equally noteworthy. = 


A ratchet for protein 


complexity 


Molecular machines containing related protein subunits are common in cells. 
Reconstruction of ancient proteins suggests that this type of complexity can 
evolve in the absence of any initial selective advantage. SEE LETTER P.360 


W. FORD DOOLITTLE 


rganisms and cells are bewilderingly 
complicated, and the molecular 
machines that perform many basic 
cellular functions are often giant, multi- 
subunit, multifunctional protein complexes 
with tangled evolutionary histories. It is 
generally assumed that such complexes 
arose by the stepwise accretion of indi- 
vidual proteins, each addition represent- 
ing a selective advantage by adding to or 
refining the machine’s performance. But 
on page 360 of this issue, Thornton and 
colleagues’ argue against this standard 
explanation in one particular instance — 
that of a ring-shaped protein complex in 
fungi. The authors show how evolutionary 
processes entailing loss of function rather than 
gain might initially drive a system towards 
complexity, independently of selection*. 
Many cellular molecular machines con- 
tain several proteins that self-assemble into a 
multi-subunit ring. In simple cases, rings are 
homo-oligomeric; that is, all the subunits 
are identical and thus probably the products 
of a single gene. In more complex (hetero- 
oligomeric) examples, the protein molecules 
in the ring are different, but may be related. 


“This article and the paper? under discussion were 
published online on 8 January 2012. 


Often, hetero-oligomeric rings seem to have 
arisen from homo-oligomeric complexes 
after a gene encoding a single subunit became 
duplicated, producing two or more gene 
copies (called paralogues), with each copy 
subsequently evolving to encode a slightly 
different protein. 
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The question that Thornton and col- 
leagues address is, why does this happen? 
Do the structurally distinguishable subunits 
and/or their specific pattern of assembly confer 
improved or additional functions on the pro- 
tein complex, with selection for their enhanced 
performance being the evolutionary driving 
force? Alternatively, might neutral processes 
be responsible, at least initially? 

To understand how the latter possibility 
might come about, imagine that a gene encod- 
ing, for example, eight subunits of a homo- 
oligomeric ring undergoes duplication (Fig. 1). 
The two paralogous gene copies that this dupli- 
cation produces — and the protein subunits 
they encode (call them A and B) — will of 
necessity begin to diverge through accumu- 
lation of neutral mutations. These mutations 
might not at first affect any of the subunits’ 
functions, so that functional octameric rings 
could continue to form bya random sampling 


Figure 1 | Evolution of complexity in a protein ring. a, Homo-oligomeric protein complexes contain 
identical subunits (A) that are typically encoded by a single gene. b, If the gene is duplicated, the two gene 
copies can diverge through the accumulation of neutral mutations, generating structurally distinguishable 
but functionally unaltered subunits A and B, which can form functional hetero-oligomeric rings by 
random mixture. c, If additional mutations prevent subunits from binding to others of their own type, 
functional rings could still be formed by alternating subunit types. As further mutations accumulate, the 
probability of returning to the initial homo-oligomeric situation becomes very small. Thus, the subunit 
composition of a protein association may become complex in the absence of initial selective advantages. 
Thornton and colleagues’ provide experimental evidence suggesting that this type of process has occurred 
in the evolution of a six-membered protein ring that forms part of the vacuolar H’-ATPase enzymes in fungi. 
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50 Years Ago 


At the International Geological 
Congress in Copenhagen in 1960, 
and at the Russian Trade Exhibition 
in London last year, one of the 
outstanding scientific exhibits was 

a hand-coloured tectonic map 

of Europe compiled by Russian 
draughtsmen from copy submitted 
by the various national geological 
surveys. The map, on a scale of 
1:2,500,000, has been sponsored by 
the Sub-Commission on the Tectonic 
Map of the World of the International 
Geological Congress ... Western 
geologists who wish to make sure of 
receiving a copy should place an order, 
in advance of publication, with their 
national agents for Russian books. 
From Nature 20 January 1962 


100 Years Ago 


The possibility of the discovery of a 
remedy for cancer has been advanced 
a stage by the preparation by Prof. 
Wassermann, of Berlin, of a substance 
which possesses a curative action 
experimentally on cancer of mice. 
Prof. Wassermann reasoned that since 
the cancer-cells are growing rapidly, 
their oxygen requirements would 
be different from, and greater than, 
those of the cells of the body generally. 
He sought for some substance which 
might interfere with the oxygen supply 
to the cancer-cells, and finally adopted 
selenium as a means to do this. The 
next problem was to convey selenium 
to the cancer-cells by means of the 
blood stream, and ... a compound of 
selenium with an anilin dye eosin was 
found to fulfil this condition ... After 
two or three injections of the substance 
into a mouse the subject of cancerous 
tumours, the tumours are found to 
have softened, and after six to eight 
doses they become cystic, diminish 
in size, and finally disappear, and no 
recurrence takes place. The eosin-— 
selenium compound is, however, 
poisonous, and a certain number of 
mice succumb under the treatment. 
From Nature 18 January 1912 
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i galaxy’s effect 


Figure 1 | Seeking satellites. Vegetti et al." have discovered a satellite galaxy around a larger galaxy, 
located at a cosmological distance, by examining how both galaxies act as gravitational lenses and distort 
light from a background source as the light travels to Earth. a, Were there no lenses between an observer 
on Earth and a source — here illustrated asa kangaroo” — the observer would, if instrumentation 
allowed, see one image of the source. b, If there is a massive lens between the observer and the source, 

the source is magnified and imaged more than once. In this case, as in the authors’ study, the source 

is deformed into a ring that consists mainly of two distorted images of the source. The two images are 
caused by the larger galaxy, whereas the dent and blob features (labelled) are caused by the satellite galaxy, 
the mass of which has been exaggerated to demonstrate the effect more clearly. 


In the present study, the gravitational lens 
system, called JVAS B1938 + 666, consists of a 
very distant galaxy that bends light from an 
even more distant background galaxy. Light 
from the background galaxy is deformed into 
a ring around the lens galaxy — a prime exam- 
ple of what is known as an Einstein ring. In this 
case, the ring is formed mainly by two lensed 
images of the background galaxy’. 

Vegetti and her team‘ obtained a near- 
infrared image of JVAS B1938 + 666 using 
the 10-metre Keck telescope in Hawaii. They 
used an optics system that corrects for the blur- 
ring effect of Earth’s atmosphere to improve 
the image quality. With such data, they could 
neatly determine the mass distribution of the 
lens galaxy, as well as the shape and brightness 
of the background galaxy. 

And here comes the connection to satel- 
lite galaxies. The sophisticated numerical 
technique’ used by the authors allowed them 
not only to derive a model of the lens galaxy’s 
mass, but also to map any excess lens mass that 
could not be accounted for by the galaxy. They 
found an excess mass near the Einstein ring 
that they attributed to the presence ofa satellite 
galaxy (Fig. 1). This method has been dubbed 
gravitational imaging®. Vegetti et al. also used 
an analytical model to test the detected excess 
mass and found that a satellite galaxy is indeed 
required to explain the data. 

This satellite galaxy is exciting because it 
was detected in the excess-mass map despite 
its low mass. Assuming that the object is in the 
neighbourhood of the main lens galaxy, it has 
amass of some 113 million solar masses within 
a radius of about 600 parsecs (2,000 light years) 
— values that put it firmly in the realm of sat- 
ellite galaxies. The mass is tenfold lower than 
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those of two other satellite galaxies that have 
been detected”* recently using gravitational 
lenses (with the possible exception of another 
low-mass satellite’, although the detection of 
this object is not yet confirmed). 

Vegetti and colleagues* went on to deter- 
mine the mass function of satellite galaxies 
— that is, the expected number of satellites 
for a given mass — at distances beyond the 
local Universe by combining data from the 
new object and another satellite galaxy’. They 
found that the resulting mass function is con- 
sistent with current galaxy-formation theory. 
But owing to the small number of known 
objects, uncertainties in the deduced function 
are quite large. 

A natural question to ask is whether the 
satellite galaxy can be observed directly rather 
than by its gravitational effect on the shape ofa 
background object. With current instrumenta- 
tion, the answer is no. The object is simply too 
distant to be imaged directly. But the message 
here’ is that it is possible to spot these elusive 
objects around distant lens galaxies without 
knowing where to look for them. 

The satellite should be considered in the 
context of about 50 satellite galaxies that are 
known to exist in the Local Group and that 
have been found by imaging their starlight*”. 
The number of these nearby satellites has 
greatly increased over the past decade, and 
often only a hundred (or fewer) stars are 
detected in them. From measurements of the 
velocities of their member stars, the satel- 
lites have been shown to be dominated by an 
unseen mass component called dark matter. 
Dark matter is the reason that satellites such as 
the newly discovered one are massive enough 
to be found using gravitational lenses. 
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Uhlmann go to some lengths to exclude the 
involvement of Cdcl4 localization within 
the nucleus, the dependence of dephosphoryla- 
tion on specific events associated with mitotic 
exit, and the specificity that results from the 
activation of CDK by its different activating 
partners (cyclins). 

Uhlmann and colleagues’ have previously 
suggested that a CDK-based oscillator acting 
as a counterbalance to protein phosphatases 
might be responsible for order not just dur- 
ing mitotic exit, but throughout cell division. 
However, there is substantial biochemical 
and genetic evidence for interdependent 
relationships that impose order throughout 
the cell cycle®. For example, a recent study® 
demonstrated that the sequential action of 
specific cyclin-CDK complexes is required to 
eliminate Sicl — a potent CDK inhibitor that 
restricts passage from G1 into S phase. The 
researchers’ report that the order of action 
of those cyclin-CDK complexes is due not 
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to differences in their intrinsic activity, but to 
differences in their specificity for particular 
phosphorylation sites on Sicl. That is, phos- 
phorylation of a set of sites by one cyclin-CDK 
complex depends on prior phosphorylation of 
Sicl by a different cyclin-CDK complex. 

It seems, therefore, that multiple mecha- 
nisms are involved in imposing order on 
cell-cycle events. Nevertheless, the find- 
ing’ that a single cyclin-CDK complex can 
orchestrate an effective cell-division cycle 
suggests that, at least in some systems and 
under certain physiological conditions, the 
specificity of different cyclin-CDK complexes 
is dispensable. It remains to be established 
whether phosphatase-based mechanisms 
similar to those described by Bouchoux and 
Uhlmann’ are essential for ordering cell-cycle 
events beyond mitotic exit, and whether they 
are even sufficient to replace cyclin-CDK 
specificity under particular conditions. It 
is largely because the other phosphatases 


Stabilizers cause 


instability 


Any chemist will tell you that amide compounds are renowned for their stability, 
yet a family of amides has been discovered that is surprisingly unstable. The 
cause of this phenomenon is even more surprising. 


JONATHAN CLAYDEN 


ome molecules are stable under all kinds 
of conditions; others are so unstable 
that they'll react with almost anything. 
Amides, the class of compounds that includes 
proteins and nylon, are usually stable mol- 
ecules. Strong acid or alkali is needed to 
break them down — nylon fabrics are highly 
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Figure 1 | Amide reactivity. a, Most amides are unreactive because the 
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resistant to chemical damage, and dietary pro- 
teins can be digested only by the acidity of the 
stomach coupled with a battery of enzymes. 
Writing in Angewandte Chemie, Hutchby 
et al.’ report a group of amides that turns 
this typical character on its head. These com- 
pounds share structural characteristics with 
the least reactive amides of all, yet a remark- 
able mechanistic quirk means they exhibit 
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that provide the counterpoint to CDK are 
poorly understood that the choreography 
of this intricate dance remains incompletely 
described. m 
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reactivity that is usually associated with unstable 
reagents. 

In most cases, the reactivity of a molecule 
can be predicted with a fair degree of cer- 
tainty by considering the combined effects of 
bond strength, polarity and steric hindrance 
(the physical volume occupied by parts of 
a molecule). For example, a carbon-iodine 
bond is more reactive than a carbon-fluorine 
bond because it is much weaker, whereas an 
oxygen-hydrogen bond is more reactive than 
a carbon—-hydrogen bond because it is more 
polarized. These effects mean that the most 
interesting properties of organic molecules 
tend to be concentrated in reactivity hotspots 
known as functional groups, each of which 
contains one or more atoms that either attract 
or repel electrons more than does carbon 
or hydrogen. The concept of the functional 
group, and the fact that functional groups have 
a typical reactivity profile, is central to the abil- 
ity of chemists to assess, predict and design the 
properties of molecules. 


Ketene Amine 


molecule. This repulsion facilitates rotation of the bond between the nitrogen 


nitrogen atom in the compounds donates a pair of electrons (shown as dots) 
to the carbonyl (C=O) group, and because the functional groups (red spheres) 
on the nitrogen atom may block the approach of attacking reagents. The curly 
arrow indicates electron donation. b, Hutchby et al.' report that very hindered 
amides, which have outsize groups on the nitrogen atom, buck the trend, 
because the groups are repelled (blue curved lines) by other groups in the 
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and the carbon of the carbonyl group, and so of the groups on the nitrogen 
atom, and prevents donation of the nitrogen’s electron pair to the carbonyl 
group. Instead, a hydrogen atom (green) on the carbon adjacent to the carbonyl 
group is transferred as a H” ion to the nitrogen, forming a zwitterion (a molecule 
that contains both a positive and a negative charge), which falls apart to produce 
a ketene and an amine. The black dot in the ketene represents a carbon atom. 
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One particular family of functional groups 
— those that contain a carbonyl group, C=O — 
is of particular importance because its reactions 
are central to most common methods of con- 
structing carbon-carbon bonds. The carbonyl 
group is electrophilic (it reacts with electron- 
rich species such as alcohols and amines), 
but just how electrophilic it is depends enor- 
mously on what else is attached to the group’s 
carbon atom. Attach an electron-withdrawing 
chlorine atom, and a highly reactive acid chlo- 
ride results; attach two chlorine atoms and you 
get phosgene, an even more reactive (and toxic) 
gas. Appending an electron-rich nitrogen atom, 
on the other hand, satisfies the carbonyl group's 
desire for electrons because the nitrogen shares 
its own electron pair with the group (Fig. 1a), 
a phenomenon known as delocalization. The 
resulting compound is an unreactive amide. 

Amides are unreactive not just because the 
nitrogen atom satisfies the electron deficiency 
of the carbonyl group, but also because the 
nitrogen atoms of amides can carry two sub- 
stituents, both of which can get in the way of 
incoming reagents (the phenomenon of steric 
hindrance; Fig. 1a). Hindered amides, in which 
the nitrogen atom carries two big groups, are 
usually among the least reactive carbonyl com- 
pounds known. This makes them valuable in 
organic chemistry — with the reactivity of the 
carbonyl group shut down, reactions can occur 
elsewhere without interference from com- 
peting side reactions at the carbonyl group. 
Hindered amides are used in this way in the 
preparation of reactive, synthetically useful 
organometallic compounds’, for example, 
and can be even less reactive towards electron- 
rich reagents than is the proverbially stable 
benzene ring**, 

The group of compounds now reported 
by Hutchby et al.’ overturns these expecta- 
tions. The authors prepared highly encum- 
bered amides in which the nitrogen atom 
carries two very large groups, and observed 
that, far from being the least reactive of their 
class, the compounds were more reactive than 
their less encumbered relatives. A possible 
explanation for this is an effect that takes over 
from steric hindrance when the groups car- 
ried by an amide’s nitrogen atom are abnor- 
mally large: the groups effectively switch 
off the carbonyl-deactivating effect of the 
nitrogen’s pair of electrons (Fig. 1b). 

Such ‘non-delocalized’ amides, in which the 
nitrogen’s electrons are forced into a position 
from which they cannot interact with the car- 
bonyl group, have been made before*®, and are 
indeed more reactive. But Hutchby et al. pro- 
pose’ that the ‘off switch’ is more subtle in their 
case. They found that these amides have struc- 
tural characteristics very similar to unhindered 
amides, but that the repulsion between the 
nitrogen’s large groups and other parts of the 
molecule makes it much more favourable for 
the groups to move away from their neigh- 
bouring atoms. This allows the bond between 


the carbonyl group and the nitrogen atom to 
rotate (Fig. 1b), in turn allowing the nitrogen 
atom to take back its previously delocal- 
ized, deactivating electrons. Those electrons 
therefore become available to grab hold ofa 
hydrogen ion (H") from the other side of the 
carbonyl group, setting in train a series of 
events — again accelerated by the size of the 
groups on the nitrogen — in which the mol- 
ecule is forced to fragment into a compound 
called a ketene, which is much more reactive 
than an amide. In the authors’ experiments, 
the ketene subsequently reacted with the sur- 
rounding solvent (methanol), forming ester 
compounds as the final products. 

Not all highly encumbered amides will 
behave as Hutchby and colleagues’ compounds 
do, only those that have a methylene group 
(CH,) on the other side of the carbonyl 
group from the nitrogen atom. And for the 
process to work really well, an electron- 
withdrawing functional group must be 
attached to the methylene group. Nevertheless, 
it is remarkable that the least reactive of car- 
bonyl compounds can so easily be bumped up 
the reactivity scale, and even more so that the 
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very change responsible — increased molecular 
bulk — usually decreases reactivity. This dis- 
covery exposes a chink in the armour of amides 
that could be used to circumvent many current 
and future challenges in synthetic chemistry. 
For example, amides could be used as a Trojan 
horse to incorporate hidden reactivity into a 
molecule, reactivity that is later revealed using 
Hutchby and colleagues’ chemistry when 
other modifications to the molecule have 
been made. m 


Jonathan Clayden is in the School of 
Chemistry, University of Manchester, 
Manchester M13 9PL, UK. 

e-mail: clayden@man.ac.uk 


1. Hutchby, M. et al. Angew. Chem. Int. Edn 51, 
548-551 (2012). 

2. Snieckus, V. Chem. Rev. 90, 879-933 (1990). 

3. Clayden, J., Foricher, Y. J. Y. & Lam, H. K. Chem. 
Commun. 2138-2139 (2002). 

4. Clayden, J., Foricher, Y. J. Y. & Lam, H. K. Eur. J. Org. 
Chem. 3558-3565 (2002). 

5. Clayden, J. & Moran, W. J. Angew. Chem. Int. Edn 45, 
7118-7120 (2006). 

6. Szostak, M. & Aubé, J. Org. Biomol. Chem. 9, 27-35 
(2011). 


Reprogramming 
clinical outcome 


On binding to its target hormone, the oestrogen-receptor protein modulates the 
expression of many genes. Changes in the receptor’s interaction with DNA have now 
been linked to clinical outcome in patients with breast cancer. SEE LETTER P.389 


GENEVIEVE DEBLOIS & VINCENT GIGUERE 


tumour growth depends on oestrogen. This 

hormone binds to and activates its nuclear 
receptor, which then binds to specific sites on 
the genome and modifies gene expression. 
Hormonal therapies that inhibit oestrogen 
synthesis or that block the oestrogen recep- 
tor with drugs such as tamoxifen can restrain 
the proliferation of cancer cells in oestrogen- 
dependent breast tumours. Consequently, the 
presence of this receptor and the expression of 
its target genes in primary tumours are gen- 
erally predictors of a good prognosis, as the 
tumour will be amenable to these treatments. 
But despite the sustained presence of the recep- 
tor, these tumours can eventually develop 
resistance to hormonal therapies and spread 
to distant sites, leading to a poor outcome. On 
page 389 of this issue, Ross-Innes et al. (Car- 
roll and colleagues)’ present evidence that 
reprogramming of the interaction of the oes- 
trogen receptor with DNA in breast tumours 


I: about two-thirds of breast cancers, 


significantly contributes to this change in 
clinical outcome. 

Functional genomics — genome-wide stud- 
ies of dynamic interactions between genes and 
proteins — has greatly enhanced our under- 
standing of the action of the oestrogen receptor 
(ERa) in breast cancer’. Such analyses have 
identified a total of some 80,000 ERa-binding 
sites in the genomes of breast-cancer cells. 
However, there is only a relatively mod- 
est overlap between the sets of binding sites 
described by different studies. This indicates 
that, with the exception of a small set of core 
binding sites, plasticity is an attribute of ERa 
recruitment to chromatin (the complex of 
DNA and proteins). So, could clinicians use the 
ERa-binding profile of a breast tumour to 
predict the cancer’s fate? 

To address this question, Ross-Innes et al. 
mapped the ERa binding sites in the genomes 
of tumour samples by sequencing DNA seg- 
ments to which ERa was bound. The results 
clearly establish that different ERa-binding 
profiles in primary breast tumours can be 
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Figure 1 | Reprogramming of oestrogen-receptor activity in breast cancer. Tumour cells that express 
the oestrogen receptor ERa are sensitive to hormone-based therapy, leading to a good therapeutic 
outcome. Carroll and colleagues’ show that ERa binding to DNA is dynamically regulated, and that 
different ERa-binding profiles are associated with different clinical outcomes. They propose 

that the transcription factor FOXA1 can mediate the reprogramming of ERa activity by facilitating 

the recruitment of ERa to different locations on DNA, leading to the development of resistance to 
hormonal therapy and a bad therapeutic outcome. The biological cues acting on FOXA1 to induce ERa 
reprogramming are unknown. Another transcription factor, AP-1, might also reprogram ERa activity in 
breast tumours. Signalling by growth factors has been linked to resistance to hormonal therapy’ and is 
known to activate AP-1. Ross-Innes et al.' observe a comparable presence of sites for AP-1 and FOXA1 
binding to DNA in ERa-bound DNA segments identified in tumours with poor outcome. 


linked to distinct clinical outcomes. Thus, 
profiling of ERa-binding sites might contrib- 
ute to a better molecular classification of breast 
tumours. 

In the current study, the authors pre- 
classified breast tumours from 15 patients 
into two groups — good and poor progno- 
sis — depending on the presence of certain 
proteins that are used as standard markers for 
responsiveness to hormonal therapy. On its 
own, this approach can be deceptive, because 
some tumours that express markers predict- 
ing high responsiveness show only a modest 
response to tamoxifen and therefore have a 
poor outcome. So, in their pre-classification, 
Ross-Innes et al.’ also took into consideration 
information from long-term follow-up of the 
patients. However, to establish precise bind- 
ing profiles that, by themselves, could be used 
to predict outcome without prior knowledge 
of other indicators, many more tumours from 
larger groups of diverse patients would have to 
be profiled. 

What mechanism underlies the differences 
among breast tumours in ERa recruitment to 
genomic sites? Previous functional genomic 
studies** have shown that the interaction of 
ERa with chromatin largely depends on the 
presence of the protein FOXA1. This belongs 
to a class of transcription factors that facili- 
tate protein binding to chromatin’. Recently, 
Carroll and colleagues® argued that FOXA1 is 
essential for ERa binding to chromatin, and 
for the receptor’s subsequent transcriptional 
activity, not only in the presence of oestrogen 


but also in cells exposed to tamoxifen. Whether 
FOXAI plays a part in the response of ERa- 
positive breast tumours to hormonal therapies 
in patients remained unclear, however. 

In the absence of co-profiles of FOXA1 and 
ERa binding in the same breast-tumour sam- 
ples, Ross-Innes et al.' addressed this question 
by searching within ERa-binding regions for 
known DNA sequence motifs recognized by 
different transcription factors. Unexpect- 
edly, they found that the FOXA1-binding 
motif occurs on ERa-bound DNA segments 
obtained from tumours associated with a poor 
outcome but not on segments from tumours 
associated with a good outcome. 

To validate this observation, the authors 
used cell-based models of breast cancer and 
again found preferential enrichment of the 
FOXA1-binding motif in tamoxifen-resistant 
cells and in cells that had been treated with a 
cocktail of agents promoting growth and pro- 
liferation. They conclude that in the tumours 
associated with poor outcome, FOXA1 could 
be triggering a rapid and dynamic reprogram- 
ming of ERa recruitment to chromatin at dif- 
ferent sites from those in tumours associated 
with a good outcome (Fig. 1). 

Whether FOXA1-dependent program- 
ming of ERa binding to DNA occurs during 
the course of the disease or is intrinsic to the 
primary tumour is yet to be determined. In 
either case, it would be of interest to identify 
the physiological cues and mechanisms dictat- 
ing FOXA1 recruitment to chromatin. Identi- 
fying FOXA1-binding sites in breast-tumour 
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samples would be an essential first step in 
assessing these issues. 

On the basis of their previous work’, 
Carroll and colleagues logically favour the 
hypothesis that FOXA1 has a key role in 
mediating reprogramming of ERa activ- 
ity. It is likely, however, that additional 
mechanisms are involved, including the par- 
ticipation of other proteins. In particular, 
AP-1, a transcription factor that stimulates 
cell growth and proliferation, might also con- 
tribute to ERa reprogramming (Fig. 1). Indeed, 
the authors’ find an enrichment of the binding 
motif for AP-1 in ERa-bound DNA segments 
from tumours associated with poor outcome. 
This observation nicely parallels previous 
results’ showing enrichment of AP-1 motifs 
in ERa-bound segments upon treatment of 
breast cancer cells with epidermal growth 
factor, a condition that mimics the increase in 
growth-factor signalling in hormone-resistant 
tumours. The involvement of AP-1 in this 
process is also supported by its function in 
facilitating chromatin availability for the bind- 
ing of the glucocorticoid receptor — another 
nuclear hormone receptor — to chromatin’. 

Over the past few years, functional genom- 
ics has enhanced our knowledge of hormone 
action. Ross-Innes and co-workers’ technical 
prowess' should motivate similar studies of 
other nuclear receptors and their interacting 
protein partners involved in the initiation and 
progression of all subtypes of breast cancer”. 
Profiling of transcription factors in cancer- 
cell lines has proved extremely useful in 
deciphering the mechanisms underlying 
hormone-driven gene expression. But using 
primary tumours and metastatic tissues, as 
Ross-Innes et al. have done, will be required 
to reach relevant conclusions about the roles 
of each of these factors in cancer. m 
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and the response to infection, how understanding the 
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of inhibitors and helped to establish new principles for 
treating cancer, the potential therapeutic promise of 
patient-derived pluripotent stem cells, the forces that 
govern tumour evolution and the impact of mouse 
genetics on the study of bone physiology. 

Richard Flavell and colleagues provide a timely 
overview of the activation and function of different 
inflammasomes. These are protein complexes that sense 
microbes and signs of tissue damage by responding 
to exogenous and endogenous signals. They can 
be beneficial and detrimental in a diverse set of 
inflammatory and metabolic diseases. 

Christopher Lord and Alan Ashworth discuss how 
understanding the different response pathways to DNA 
damage has led to a multipronged approach to cancer 
therapy. Inherent mutations of the cancer cell and repair- 
enzyme inhibitors are used to synergistically target 
multiple repair pathways to enhance the cell killing effect. 

Daisy Robinton and George Daley highlight the recent 
progress in the derivation of induced pluripotent stem 
cells, outline the functional assessments of pluripotency 
and discuss whether induced pluripotent stem cells 
equal embryonic stem cells, and thus may be a suitable 
alternative for research and therapy. 

Mel Greaves and Carlo Maley consider our current 
knowledge of clonal evolution in cancer. Tumours and 
metastases evolve through the successive accumulation 
of genetic and epigenetic alterations and clonal 
selection within the adaptive environment. This leads to 
phenotypic and functional heterogeneity of cancer cells 
within a tumour. Although therapy can eliminate most 
cancer cells, it can drive the selection of resistant clones. 

Finally, Gérard Karsenty and Mathieu Ferron discuss 
how recent evidence has changed our view of the 
skeleton and revealed an intimate connection between 
bone and the regulation of metabolism. 
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Inflammasomes in health and disease 


Till Strowig'*, Jorge Henao-Mejia’*, Eran Elinav'* & Richard Flavell’? 


Inflammasomes are a group of protein complexes built around several proteins, including NLRP3, NLRC4, AIM2 and NLRP6. 
Recognition of a diverse range of microbial, stress and damage signals by inflammasomes results in direct activation of 
caspase-1, which subsequently induces secretion of potent pro-inflammatory cytokines and a form of cell death called 
pyroptosis. Inflammasome-mediated processes are important during microbial infections and also in regulating both 
metabolic processes and mucosal immune responses. We review the functions of the different inflammasome complexes 
and discuss how aberrations in them are implicated in the pathogenesis of human diseases. 


limit harm to the body’. However, dysregulated and chronic inflam- 
mation may result in secondary damage and immune pathology to 
the host. Inflammation is initiated on the sensing of signs of acute dam- 
age or disturbances of the steady state. Several recognition systems have 
evolved together to distinguish between homeostasis and threats to the 
host. Some of these receptors recognize distinct conserved pathogen- 
associated molecular patterns (PAMPs) that are predominantly found 
in microbes and hence allow the exquisite sensing of pathogens in tis- 
sues that are normally devoid of these structures”. Multiple additional 
receptors recognize host-derived signals, called damage-associated 
molecular patterns (DAMPs), that are released as a result of pertur- 
bations of tissue homeostasis caused by microbial or non-microbial 
insults, allowing a general sensing of stressed tissue’. 
Inflammasomes are a group of protein complexes that recognize 
a diverse set of inflammation-inducing stimuli that include PAMPs 
and DAMPs and that control the production of important pro-inflam- 
matory cytokines such as interleukin-16 (IL-1) and IL-18 (refs 4, 5). 
Furthermore, they have been found to regulate other important aspects 
of inflammation and tissue repair such as pyroptosis, a form of cell 
death. Recent studies have characterized distinct molecular activation 
mechanisms for several sensor proteins and have identified a multitude 
of ligands of both endogenous and exogenous origins. Moreover, the 
diverse functions of these complexes in antimicrobial responses, as well 
as in multifaceted diseases such as metabolic syndrome and inflam- 
matory bowel disease (IBD), have started to be revealed. Importantly, 
mutations in components of inflammasome complexes have been 
associated with a propensity for the development of several immune- 
mediated diseases in humans. We review several of these conditions, 
discuss the different models that have been proposed for inflamma- 
some involvement in normal and aberrant immune response, and 
highlight the challenges and future directions for this field. 


I nflammation is an acute response to infection and tissue damage to 


Inflammasomes are intracellular innate immune sensors 


Inflammasomes control the activity of caspase-1 

IL-1 is one of the quintessential pro-inflammatory cytokines 
that broadly affects inflammatory processes®. Tight control of its 
production is therefore required at the transcriptional and post- 
translational levels. IL-1 is synthesized as a pro-protein without a 
typical signal sequence that would allow its secretion, and instead 
its activation and cellular release are controlled by the cysteine 


protease caspase-1 (ref. 7). Similarly, caspase-1 is responsible for 
the processing and secretion of IL-18, as well as the secretion of 
other proteins such as IL-1a and fibroblast growth factor-2 through 
an unconventional protein secretion pathway*. Moreover, caspase-1 
is required for pyroptosis, a form of cell death frequently observed 
during microbial infections that combines characteristics of apopto- 
sis (DNA fragmentation) and necrosis (inflammation and cytokine 
release)’. Like other caspases, caspase-1 is synthesized as an inac- 
tive zymogen (pro-caspase-1) and becomes proteolytically active 
only after controlled dimerization in inflammasomes that are built 
around one of several different molecules**”* '* (Fig. 1). Whereas 
the leucine-rich repeat (LRR) domain is thought to be involved in 
autoinhibition that is disabled on direct or indirect sensing of the 
activating signal, the nucleotide-binding domain (NBD) is involved 
in the regulation of homo-oligomerization or hetero-oligomeriza- 
tion, which is required for inflammasome assembly. On receiving 
an activating signal, inflammasome sensors recruit pro-caspase- 1 
(which has a caspase activation and recruitment domain (CARD)) 
either directly through homotypic binding of CARD or indirectly 
through a pyrin domain (PYD) by means of the adaptor apoptosis- 
associated speck-like protein containing a CARD (ASC), which 
contains a PYD anda CARD. 


Diverse signals induce inflammasome formation 
Inflammasome assembly is unique in its induction by a variety of 
both exogenous and endogenous signals. The range of activation sig- 
nals sensed by each protein is distinct, but may include overlapping 
signals (Box 1). Whereas the AIM2 and NLRC4 inflammasomes are 
activated only by specific PAMPs, double-stranded DNA (dsDNA) 
and specific bacterial proteins, respectively'*’, NLRP3 is activated 
by a large variety of signals, including PAMPs, DAMPs and bacterial 
toxins’. The structural diversity of ligands activating the NLRP3 
inflammasome is in striking contrast to other innate pattern recogni- 
tion receptors such as Toll-like receptors, which usually recognize 
more confined structural motifs”. Several models have therefore 
been proposed to explain how all of these signals could activate the 
NLRP3 inflammasome (Box 1). These non-exclusive mechanisms 
include both direct and indirect signal recognition mediated by 
additional accessory proteins”’****, We discuss the nature of some 
of these NLRP3-activating signals in the context of the pathophysiol- 
ogy of several diseases, but they have been more extensively reviewed 
elsewhere*”. 


1Department of Immunobiology, Yale University, Connecticut 06520, USA. *Howard Hughes Medical Institute, Maryland 20815-6789, USA. *These authors contributed equally to this work. 
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Figure 1 | Domain organization of inflammasome proteins. The identified 
core components belong to two families, the NOD-like receptor (NLR) family 
and the PYHIN (pyrin and HIN200 (haematopoietic interferon-inducible 
nuclear antigens with 200 amino-acid repeats) domain-containing protein) 
family. The NLR family members include NLRP1, NLRP2 (not shown), 
NLRP3, NLRP6, NLRC4 and, potentially, NLRP12. They all contain a 
nucleotide-binding domain (NBD), carboxy-terminal leucine-rich repeat 
(LRR), and can contain either a PYD or a caspase activation and recruitment 
domain (CARD) or both. The PYHIN family members AIM2 and IFI16 are 
characterized by having, in addition to a PYD, a HIN200 domain, which is 
involved in ligand binding. BIR, baculoviral inhibition of apoptosis repeat 
domain; FIIND, domain with function to find. 


Regulation of inflammasome activity 

Inflammasome activity needs to be tightly regulated by the host to 
avoid the excess production of cytokines or overt cell death. Regula- 
tion occurs at transcriptional and post-transcriptional levels. First, the 
expression ofinflammasome sensors, in particular NLRP3, is relatively 
low in many cell types and requires a priming signal to be induced”. 
This signal is frequently referred to as signal I and can be provided 
by microbial ligands, cytokines or reactive oxygen species (ROS). In 
addition, alternative splicing of inflammasome components gener- 
ates protein variants with different activities. Indeed, splice variants of 
ASC have been identified with distinct abilities to serve as an inflam- 
masome adaptor, with one variant even having inhibitory activity”. 
Furthermore, the host expresses proteins that regulate inflammasome 
activity mainly by sequestering inflammasome components through 
homotypic interaction of CARDs or PYDs or through directly inhibiting 
caspase-1 function®*””. Similarly, subcellular location and trafficking of 
inflammasome components seems to be important for the regulation 
of inflammasome activity because ASC is found mainly in the nucleus 
in resting cells*. Another level of regulation may occur through the 
assembly of complexes with different components, suggested by the 
observation that cytokine processing downstream of NLRC4 activa- 
tion is ASC dependent, whereas pyroptosis is independent of ASC*". In 
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addition, it has been suggested that differential processing of caspase-1 
might contribute to this finding because catalytically active caspase-1 
that has been rendered unable to perform autocleavage is impaired in 
its ability to cleave cytokines, but it is still able to initiate pyroptosis 
on NLRC4-mediated recognition of Salmonella infection”. Regula- 
tion of inflammasome activity is also achieved through crosstalk with 
cellular stress-associated processes such as autophagy. Induction of 
autophagy leads to the degradation of cellular substrates such as pro- 
tein aggregates and organelles in autolysosomes for the recycling of 
metabolites. Strikingly, cells deficient in autophagy have a decreased 
threshold for inflammasome activation”. This has been suggested to 
be a result of the impaired clearance of defective mitochondria result- 
ing in elevated levels of ROS, hinting at an involvement of NLRP3 as 
sensor**”**, Another aspect of the regulation of inflammasome activity 
is its downregulation either through secreted factors or cell-cell inter- 
actions. Examples of these signals are type I interferons or interactions 
between CD4" T cells and macrophages or dendritic cells, respectively, 
leading to transcriptional and post-transcriptional downregulation 
of inflammasome activity”. All of these process probably cooperate 
in the temporal and spatial organization of inflammasome-mediated 
processes. In summary, the host has evolved distinct mechanisms to 
regulate inflammasome activation to prevent the dire consequences of 
inflammasome overactivation. 


Regulation of inflammation by inflammasomes 

Inflammasome-regulated processes depend on the simultaneous 
expression of the multiple inflammasome protein components in the 
same cell type of inflamed tissues. ASC and caspase-1 are found in 
many tissues and cell types, whereas the inflammasome sensors feature 
amore distinct expression pattern, suggesting tissue-specific mecha- 
nisms for sensing the microenvironment. In the following sections, we 
discuss how activation of inflammasomes in different lineages of cells 
regulates physiological reactions in the context of health and disease. 


Inflammasomes and the antimicrobial response 

In vivo, inflammasomes have been shown to participate in the 
antimicrobial innate immune response”. The most widely studied 
inflammasome in this regard is the NLRP3 inflammasome, shown to be 
involved in antibacterial, viral, fungal and parasitic immune responses. 
Despite the evidence linking the NLRP3 inflammasome to the immune 
response to infection, only in a minority of cases has inflammasome 
activation by direct recognition of the pathogen been documented; 
many studies have indicated inflammasome activation through induc- 
tion of signals related to cellular stress and damage’ (see Box 1). The 
influenza A virus is an example of an indirect viral NLRP3 inflam- 
masome activator. On infection, recognition of viral RNA by means 
of Toll-like receptor 7 (TLR7) induces transcription of the NLRP3 
inflammasome components”. Subsequently, the activity of the viral 
ion-channel protein M2 induces pH neutralization of the trans-Golgi 
network, leading to potassium efflux and ROS formation, which in turn 
induce NLRP3 inflammasome assembly. Recently, messenger RNA 
of microbial origin was shown to activate the NLRP3 inflammasome 
ina TRIF (TIR-domain-containing adaptor-inducing interferon-B)- 
dependent manner, providing a mechanism for NLRP3 recognition of 
infection and the resultant differentiation by the host between viable 
and non-viable bacteria, leading to the induction of a potent immune 
response only on exposure to live microbes”’. Mouse NLRP1 inflamma- 
some activation is central to the initiation of the antimicrobial response 
to Bacillus anthracis infection through caspase-1-induced pyroptosis of 
infected macrophages, which permits self-limitation of infection and 
initiation of an antimicrobial neutrophilic reaction”. Human NLRP1 
forms an ASC-dependent inflammasome, whereas mouse NLRP1b 
may activate caspase-1 in an ASC-independent manner”. NLRP1, like 
NOD2 and NLRP3, can sense muramyl dipeptide (MDP), a building 
block of the bacterial cell wall, suggesting possible cooperative roles 
of the different pattern recognition receptors in the sensing of and 
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response to bacteria. NLRC4 inflammasome activation is driven by 
type [II and type IV secretion systems (T3SS and T4SS) of bacteria 
such as Salmonella, Pseudomonas, Legionella and Yersinia, which allow 
the cytoplasmic entry of the NLRC4 ligand flagellin, leading to acti- 
vation of the NLRC4 pathway’. In addition, flagellin-independent 
activation of the NLRC4 inflammasome involves recognition of the 
T3SS rod protein PrgJ*’. Interestingly, NLRC4 activation during 
Legionella infection is dependent on another NLR protein, NAIP5, 
but only partial dependence on NAIP5 was demonstrated for NLRC4 
activation during Pseudomonas and Salmonella infection*”’. This 
apparent discrepancy was recently resolved by demonstrating that 
oligomerization of different NAIP family proteins with NLRC4 con- 
fers ligand specificity, so that a NAIP2-NLRC4 complex binds PrgJ 
and other structurally related rod proteins, whereas a NAIPS-NLRC4 
complex binds flagellin®’. Different pathogen receptors therefore 
assist NLRC4 in the recognition of microbial ligands, broadening the 
diversity of sensed structures. The recently discovered protein AIM2 
recognizes bacterial and viral dsDNA, resulting in an antimicrobial 
response to intracellular pathogens, such as Francisella tularensis, Lis- 
teria monocytogenes and some DNA viruses”. 

During microbial infection, distinct effector mechanisms of 


inflammasomes seem to be important for host defence. Although 
inflammasome-induced IL-1 and IL-18 are essential for the clear- 
ance of influenza virus and Shigella, respectively, pyroptosis has been 
suggested to be essential for caspase-1-mediated effects in vivo during 
infection by Salmonella enterica Typhimurium, Legionella pneumophila 
or Burkholderia thailandensis*’. Altogether, it seems that the coopera- 
tive activity of distinct inflammasomes is pivotal in the host's ability to 
raise a protective immune response. This cooperation is illustrated dur- 
ing infection by S. Typhimurium, in which deficiencies in either NLRP3 
or NLRC4 in vivo do not lead to increased bacterial infection, whereas 
mice deficient in both NLRC4 and NLRP3 have a similarly increased 
susceptibility to infection to that of caspase-1-deficient mice” (Fig. 2). 

To avoid inducing inflammasome activation, many pathogens 
encode inhibitors of inflammasome function™. Frequently, these 
inhibitors, such as NS1 of influenza virus or Crm1 of cowpox virus, act 
on central inflammasome components and directly inhibit caspase-1 
activity****. Other inhibitors, such as ORF63 of Kaposi’s sarcoma-asso- 
ciated herpesvirus, bind NLR sensors and prevent the formation 
of NLRP1 and NLRP3 inflammasomes”. In general, the inhibition 
of inflammasome activity by the immune-evasive mechanisms of 
many microbes highlights the importance of these complexes in 


BOX1 


Inflammasomes are assembled after sensing 
a structurally diverse repertoire of PAMPs and 
DAMPs. Several models have been proposed 
to explain how these signals are sensed, 
including models based on recognition of 
general cellular stress (Fig. a and b) or on 
direct and indirect recognition of activation 
signals (Fig. c-e). NLRP3 senses the reactive 
oxygen species (ROS), which is produced in 
the cell (potentially by mitochondria) directly 
or indirectly by activators of the NLRP3 
inflammasome. Increased amounts of ROS 
are sensed by a complex of thioredoxin and 
thioredoxin-interacting protein (TXNIP), 
leading to the dissociation of this complex. 
Subsequent binding of TXNIP to NLRP3 leads 
to the activation of NLRP3, the recruitment of 
ASC and pro-caspase-1, and formation of the 
active inflammasome complex (a). NLRP3 

is activated after lysosome destabilization. c 
The phagocytosis of specific crystalline and 
particulate structures can lead to lysosome 
destabilization and the release of lysosomal 
content, including proteases. These proteases 
could lead to proteolytic inactivation of a 
negative regulator or to proteolytic activation 
of a positive regulator of NLRP3, resulting 

in inflammasome assembly (b). NLRP1 

and AIM2 sense the ligand directly. The 
direct binding of specific ligands (muramyl 
dipeptide (MDP) and double-stranded 

DNA (dsDNA)) can lead to conformational 
changes in NLRP1 and AIM2, resulting in 
inflammasome activation. Inflammasome 
formation in NLRP1 is independent of ASC 
(c, d). NAIP proteins sense bacterial proteins 
resulting in the recruitment of NLRC4 and 
assembly of the NLRC4 inflammasome (e). 
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antimicrobial immunity. Over the next few years, it will be important 
to further characterize how pathogens are sensed by inflammasomes 
and how inflammasomes interact with additional innate immune 
pathways to regulate antimicrobial immune responses in vivo. 


Inflammasome activation by particulate compounds 

In addition to infectious agents, there is evidence in mice and in humans 
that aberrant inflammasome activation by non-infectious agents may be 
linked to the pathogenesis of diseases characterized by sterile inflamma- 
tion. For instance, crystal deposition diseases are a group of disorders 
in which inflammatory damage is elicited by exposure to exogenous or 
endogenous crystalline molecules. Examples of exogenous activators 
are silica and asbestos, whose endocytosis by pulmonary macrophages 
results in NLRP3 inflammasome activation involving ROS and lyso- 
some destabilization, leading in turn to silicosis and asbestosis, respec- 
tively**°”. Similarly, aberrant formation of crystals from endogenous 
molecules such as monosodium urate (MSU) and calcium phosphate 
may lead to NLRP3 inflammasome activation in macrophages. Indeed, 
particular accumulation of MSU had long been noted during gout, but 
only recently has it been demonstrated that NLRP3-deficient mice fea- 
ture defective MSU-induced neutrophil infiltration and inflammation” 
(Fig. 2). On the basis of these findings, clinical trials have been started 
to block IL-1 in gout; promising early results suggest that aberrant 
inflammasome activation drives inflammation in human crystal depo- 
sition disease. Furthermore, in patients with osteoarthritis, a common 
degenerative articular disorder, uric acid levels in synovial fluids are 
positively correlated with both IL-1f and IL-18 levels, as well as disease 
severity®®. Calcium phosphate crystals were recently shown to activate 
NLRP3 (ref. 59). Hydroxyapatite crystals, a component of bone, are 
frequently found in osteoarthritis synovial fluid, activate IL-1 produc- 
tion by means of the NLRP3 inflammasome, and mediate inflamma- 
tion and joint disease”. Future studies and clinical trials will be needed 
to explore whether targeting inflammasomes and their substrates are 
effective treatment options for these diseases. 

An interesting example of inflammation that can be induced by 
an exogenous crystalline compound is the activation of the NLRP3 
inflammasome by alum, an aluminium salt and the most commonly 
used adjuvant in human vaccines. Exposure of macrophages to alum 
in vitro leads to the NLRP3-dependent activation of caspase-1, and 
several studies have demonstrated that NLRP3 inflammasome-defi- 
cient mice have defects in alum-induced adaptive immune responses 
in vivo” *, Several other studies have debated the role of the NLRP3 
inflammasome in the adjuvant effect of alum, some suggesting that vac- 
cination-induced adaptive immune responses may be ASC dependent 
but independent of caspase-1 (and hence independent of inflammas- 
omes)***. A recent study suggested an alternative NLRP3-independ- 
ent mechanism for alum’s adjuvant effects, namely attachment to the 
plasma membrane of dendritic cells, leading to abortive phagocytosis 
and imprinting dendritic cells to induce a humoral immune response”. 
The precise mechanisms related to vaccination adjuvant effects and 
the possible contribution by inflammasomes remain to be clarified in 
future studies. 


Inflammasomes and metabolic syndrome 

The incidence of obesity worldwide has increased markedly in 
recent decades. Obesity is associated with multiorgan (namely 
pancreatic, adipose, hepatic, cardiac and muscle tissue) chronic 
metabolic and inflammatory alterations that together are termed 
‘metabolic syndrome. These alterations include impaired insulin 
sensitivity, pancreatic B-cell dysfunction, non-alcoholic fatty liver 
disease and atherosclerosis. Unequivocal experimental and clini- 
cal evidence causally link IL-1B and IL-18 to the development of 
these metabolic pathologies and their complications”. However, 
the underlying molecular mechanisms that result in tissue-specific 
inflammasome activation in the context of obesity have only just 
begun to be explained. 
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Figure 2 | Inflammasome activity regulates inflammation during 
microbial infection and autoinflammatory diseases. a, S. typhimurium 
enters the host by crossing through the intestinal epithelial barrier. M cells, 

a specialized epithelial cell type layering Peyer's patches, are particularly 
involved in the transcytosis of Salmonella and the infection of macrophages 
in Peyer’s patches. The inflammasome and caspase-1 are involved in several 
cell types and at several steps of the infection. Injection of the bacterial 
effector protein SopE into epithelial cells induces the activation of caspase-1 
independently of NLRP3 and NLRC4 through a process involving the GTPase 
Racl. The resultant mucosal inflammation is dependent on IL-1 and IL-18 
produced by non-haematopoietic cells. On infection of macrophages, the 
bacterial proteins flagellin and PrgJ (part of the T3SS) are sensed by means 

of NLRC4. This results in the activation of caspase-1, leading to both IL-1B/ 
IL-18 processing and pyroptosis, which limit systemic infection. NLRP3 
contributes to these processes by recognition of an unknown signal. b, 
Phagocytosis of monosodium urate (MSU) crystals by macrophages induces 
the NLRP3-dependent caspase-1 activation and release of IL-1, which 
stimulates non-haematopoietic cells to produce IL-6 and chemokines (CXCL1 
and CXCL8), attracting neutrophils. Activated neutrophils then cause tissue 
damage. Therapeutic blockade of IL-16 in humans ameliorates inflammatory 
bouts in gout. 


Inflammasome activation regulates insulin signalling 

NLRP3 inflammasome components and caspase-1 activation are 
increased in the adipose tissue and liver of obese mice and humans; 
moreover, their level of expression is directly correlated with the 
severity of type 2 diabetes mellitus (T2DM) in obese individuals”. 
During nutritional surplus, in addition to adipocyte hypertrophy 
owing to increased lipid storage, adipose tissue is infiltrated by clas- 
sically activated, M1, macrophages that secrete pro-inflammatory 
cytokines’’. NLRP3, ASC and caspase-1 are preferentially expressed 
in adipose-tissue-infiltrating macrophages, in which the saturated fatty 
acid palmitate and lipotoxic ceramides trigger NLRP3 inflammasome 
activation through a mechanism that involves defective autophagy 
and the accumulation of mitochondrial ROS”” (Fig. 3). Enhanced 


19 JANUARY 2012 | VOL 481 | NATURE | 281 


© 2012 Macmillan Publishers Limited. All rights reserved 


REVIEW 


Adipose tissue 


Adipose tissue 
macrophage 


Palmitate, 
ceramides J Adipocyte 
maturation 
Oxidation 
Khe ——————- 
I-18 T Insulin 
resistance ; 
Adipocyte 
T Insulin 
resistance Liver 
Hepatocyte , 


Figure 3 | The role of inflammasomes in metabolic syndrome. During 
obesity, the NLRP3 inflammasome is activated by obesity-associated DAMPs 
in multiple tissues and cell types; the resultant pro-inflammatory-induced 

state often leads to a deterioration in metabolic functions. In adipose tissue, 
palmitate and ceramides activate the NLRP3 inflammasome in infiltrating 
macrophages, which leads to an enhancement of insulin resistance. In addition, 
caspase-1 activation through an unknown sensor protein regulates adipocyte 


NLRP3-mediated caspase-1 activation in adipose-tissue-infiltrating 
macrophages of obese mice results in at least two deleterious conse- 
quences in insulin-sensitive tissues: first, IL-1 inhibits insulin signal- 
ling by direct serine phosphorylation of insulin receptor substrate 1 and 
induces the expression of tumour necrosis-factor-a, a well-character- 
ized insulin-resistance-promoting cytokine; and second, IL-1f and 
IL-18 induce type 1 CD4 T helper cells in adipose tissue”. 

Inflammasomes also seem to have adipocyte-intrinsic functions. 
Caspase-1 is upregulated during adipocyte differentiation, in which it 
promotes insulin resistance through autocrine IL-1B-mediated effects; 
however, perhaps more interestingly, caspase-1-deficient precursors 
differentiate more efficiently into mature adipocytes and have an 
increased oxidation rate”’ (Fig. 3), which suggests that adipocyte- 
specific proteins may serve as caspase-1 substrates, leading to func- 
tional changes in adipose tissue of obese animals. Hence, Nirp3" 3 
Asc’ and Casp1~" mice have been reported to have decreased weight 
gain and fat mass, as well as decreased insulin resistance. Interestingly, 
the loss of function of NLRP3 decreases but does not eliminate cas- 
pase-1 activation in adipose tissue or liver, suggesting that additional 
inflammasome sensors might contribute to the pathophysiology of 
obesity; furthermore, the nature of the stimuli that trigger caspase-1 
activation during the accumulation of fat in adipocytes remains to 
be determined. 


Inflammasome activation impairs §-cell function 

Chronic hyperglycaemia as a result of peripheral insulin resistance 
is compensated for by increased insulin output by pancreatic 6 cells. 
Local inflammatory processes coupled with the toxic effects of glu- 
cose lead to accelerated mass loss of 6 cells and decreased insulin 
secretion over time, which prompts the progression from obesity and 
insulin resistance to overt T2DM. IL-1, preferentially expressed by 
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differentiation and fatty acid oxidation. In the pancreas, [APP and increased 
mitochondrial ROS production activate the NLRP3 inflammasome in 
mmLDL-primed macrophages and f cells, respectively. The increased levels of 
IL-1 in pancreatic islets result in increased B-cell death and decreased insulin 
production. Minute cholesterol crystals in early atherosclerotic lesions activate 
the NLRP3 inflammasome in mmLDL-primed macrophages, promoting 
inflammatory-cell infiltration and increased atherosclerosis progression. 


pancreatic infiltrating macrophages and to a smaller extent by 6 cells, 
has been implicated as a critical driver of §-cell death in conditions of 
chronic exposure to elevated concentrations of glucose”. Additional 
support for a pathological role of inflammasomes in T2DM comes 
from a recent report showing that glyburide, an insulin-secretion- 
promoting drug, suppresses NLRP3-mediated IL-1 release”, and 
more importantly from human clinical trials of IL-1 receptor antago- 
nist (IL-1RA) to treat T2DM that demonstrated improved glycaemic 
control and B-cell function”’. 

In pancreatic islet failure during T2DM, NLRP3 inflammasomes may 
be activated by different mechanisms. First, hyperglycaemia triggers 
mitochondrial ROS production in 6 cells by increasing the activity of 
the electron transport chain, which induces thioredoxin-interacting 
protein (TXNIP) to dissociate from thioredoxin, leading to the activa- 
tion of NLRP3 (ref. 25) (Fig. 3 and Box 1). Second, the severity of T2DM 
is closely correlated with the levels of deposition of amyloid polypep- 
tide (IAPP, also known as amylin) in pancreatic islets’. Remarkably, 
IAPP has been shown to specifically activate the NLRP3 inflammasome 
in mouse pancreatic macrophages previously primed with the abun- 
dant minimally oxidized low-density lipoprotein (mmLDL) through 
a pathway that involves disruption of the phagolysosomal pathway”® 
(Fig. 3). Further evidence for the role of IAPP and NLRP3 activation 
in pancreatic islet deterioration is provided by the observation that 
macrophages express more IL-1 in pancreatic islets of human [A PP- 
transgenic mice”. Overall, secreted IL-1 can then signal in an auto- 
crine or paracrine manner to induce f-cell death or dysfunction and 
to promote the expression of chemotactic factors that further worsen 
immune-cell infiltration. It is important to mention that TXNIP is 
not required for NLRP3 activation in macrophages, strengthening 
the notion that the cellular milieu is a critical determinant of the way 
in which the NLRP3 inflammasome senses damage signals in the 
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context of metabolic abnormalities”. However, despite great advances 
in recent years, several questions remain unresolved, including the 
mechanisms of pro-IL-16 induction in 6 cells, the exact contribution 
of inflammasome activation in infiltrating macrophages and 6 cells 
to disease progression, and the potential roles of additional inflam- 
masome sensors in pancreatic inflammatory processes. 


Inflammasome activation and atherosclerosis 

The inflammatory nature of atherosclerosis is well established, but the 
biological agents that trigger artery wall inflammation remain largely 
unknown. Cholesterol crystal deposition in arterial vessels has long 
been a pathognomonic feature of atherosclerosis; moreover, recent evi- 
dence suggests that they are present at early stages of atherosclerotic 
lesions, coinciding with the first appearance of inflammatory cells”. 
Similarly to IAPP, cholesterol crystals activate the NLRP3 inflamma- 
some through phagolysosome destabilization in mmLDL-primed 
mouse and human macrophages” (Fig. 3). LDL-receptor-deficient 
mice (prone to atherosclerosis) reconstituted with bone marrow defi- 
cient in NLRP3, ASC or IL-1a/B are markedly resistant to the devel- 
opment of atherosclerosis, suggesting that NLRP3 inflammasome 
activation and IL-1 secretion from the haematopoietic compartment 
are key events in the early stages of disease”. Another model for 
atherosclerosis is the Apoe mouse, which develops severe hyper- 
cholesterolaemia and spontaneous atherosclerosis when being fed 
with a high-fat diet®®. Earlier reports have demonstrated that IL- 
1-receptor-deficient Apoe’” mice and IL-1RA-treated Apoe mice 
showed decreased atherosclerosis, suggesting a role for the inflam- 
masome in atherogenesis in this mouse model”. Interestingly, using 
mice deficient in both ApoE and different components of the NLRP3 
inflammasome, Menu et al.** showed that atherosclerosis progresses 
independently of the NLRP3 inflammasome in this context. The most 
reasonable explanations for the discrepancies between the two studies 
lie in the differences between atherosclerosis models (Apoe ' versus 
Ldlr’-) and ina putative role for IL-1a in the Apoe~ mouse model. 
However, it also raises the interesting possibility that environmen- 
tal factors could account for the contradictory phenotypes. Clearly, 
further research is warranted regarding the implication of the inflam- 
masomes in the pathogenesis of atherosclerosis and their role in non- 
haematopoietic cells during atherosclerotic plaque formation. 


Inflammasomes and the mucosal immune response 

In the gastrointestinal mucosa, the host is separated from an immense 
microbial ecosystem by only a single layer of epithelial cells. To avoid 
ongoing inflammatory reactions to commensal microbes and food 
antigens, while preserving the ability to react to pathogenic insults, 
mammals have evolved a complex mucosal immune system composed 
of epithelial and stromal cells acting together with subsets of cells of 
haematopoietic origin. These cells interact closely with each other 
and with the surrounding microbial milieu. When such interactions 
are perturbed, autoinflammation may develop, potentially leading to 
IBD. The ability of inflammasomes to recognize exogenous and endog- 
enous signals has led to several studies characterizing their role during 
chemical-induced intestinal autoinflammation, a model for human 
IBD. Interestingly, conflicting results have been reported by several 
groups (Fig. 4). Using a common acute and chronic epithelial injury 
colitis model based on the administration of dextran sulphate to mice, 
several groups reported decreased disease severity in mice deficient in 
caspase-1 or NLRP3, which correlated with lower IL-1$ production 
during disease***. Using the same model, other groups found that mice 
deficient in NLRP3, ASC and caspase-1 show an exacerbated disease 
severity” °°. In these reports, a role was suggested for the NLRP3 inflam- 
masome in the promotion of tissue regeneration in response to injury. 
Although variations in these results could be explained by differences in 
experimental designs, a recent paper” has offered an alternative expla- 
nation. This study reported that an NLRP6 inflammasome participates 
in the steady-state regulation of the commensal microflora. Deficiency 
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in this inflammasome is associated with an alteration in the microflora 
communities and the emergence of normally suppressed bacteria that 
have pro-inflammatory activity. It was suggested that regulation of the 
microflora by epithelial inflammasomes is mediated, at least in part, 
through the induction of basal secretion of IL-18 by epithelial cells. 
Discrepancies in the earlier studies may therefore be explained by differ- 
ences in microflora communities in different animal facilities, coupled 
with the inability of NURP6-inflammasome-deficient mice (Nirpo", 
Asc, Casp1~’) to regulate the microflora in a given facility. 

Phenotypic alterations may be induced by differences in animal 
housing; wild-type mice housed with NLRP6-inflammasome-deficient 
mice acquire their colitogenic microflora and develop phenotypes that 
may be profoundly different from that of wild-type mice housed alone. 
The factors inducing the formation of the NLRP6 inflammasome and 
the precise effector mechanisms for regulation of the microflora remain 
to be studied. In addition to NLRP6, other inflammasomes that are 
expressed mainly within the haematopoietic compartment, such as the 
NLRP3 inflammasome, may also function by regulating the microflora, 
as well as the autoinflammatory process itself. In the mucosal environ- 
ment, inflammasomes therefore function in the sensing of pathogens 
and the commensal flora by non-haematopoietic cells, such as the 
epithelial layers, but also by haematopoietic cells. They cooperate to 
maintain tolerance towards commensal microbes and to initiate a potent 
immune response towards pathogens (Fig. 4). Distinct inflammasomes 
expressed in different cell lineages may orchestrate these seemingly 
opposite functions during acute mucosal inflammation. 

It has been suggested that inflammasomes participate in 
inflammation-induced tumorigenesis, a common complication of 
chronic intestinal autoinflammation (such as IBD), yet mechanisms 
accounting for this regulatory effect remain elusive*”*’. Two recent 
reports suggest that NLRP6 deficiency may be associated with enhanced 
inflammation-induced tumorigenesis through impaired mucosal self- 
renewal and proliferation mediated by alterations in the intestinal stem- 
cell niche, or alternatively by an impaired haematopoietic immune 
response”””’, It remains to be determined whether the effect of NLRP6 
on tumorigenesis is a direct consequence of NLRP6 deficiency in one 
or more intestinal cell types or whether it is indirectly induced by the 
documented alterations in the regulation of microflora composition 
of these mice. 

Alternatively, another report suggests that epithelial NLRC4 
inflammasome activity may also participate in the prevention of intes- 
tinal inflammation-induced tumorigenesis, because NLRC4-deficient 
mice were more prone to the development of colonic neoplasms after 
the induction of chronic inflammation™. This effect was independent 
of inflammation or the composition of the intestinal microflora and 
was suggested to involve enhanced proliferation of epithelial cells and 
impaired apoptosis of colonic epithelial cells. The precise molecular 
mechanism linking the NLRC4 inflammasome dysfunction to increased 
tumorigenesis remains to be explored. 


Future perspectives 

The discovery of inflammasomes as protein platforms that control the 
processing of IL-1 and IL-18 constitutes a milestone in the innate 
immunology field. However, many questions remain unanswered, 
including those regarding the biochemical and genetic regulation of 
these protein complexes, as well as their roles in complex diseases. 
These, in our opinion, may be the focus of inflammasome research in 
coming years. 

The mechanism through which sensor proteins recognize activating 
signals remains a critical unresolved issue in the field. Although in vitro 
reconstitution assays with purified components have been performed 
with NLRP1 and AIM2 and have shown direct binding of the respec- 
tive ligands to both sensors, intense research has not been successful 
in identifying such direct interactions with any of the other inflam- 
masome sensors’ !”“?. Hence, it is plausible that as yet unidentified 
proteins may facilitate the recognition of structurally diverse activating 
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signals and ligands by NLRP3. Such ‘bridging’ mechanisms have been 
suggested to link cellular stress (namely potassium efflux, generation 
of ROS and lysosome destabilization) and microbial proteins to NLRP3 
(refs 21, 24, 25). However, the exact identity of these bridging mol- 
ecules remains to be fully characterized. Indeed, for the NLRC4 inflam- 
masomes, NAIP proteins were identified as additional components that 
are responsible for the specificity of microbial ligands recognized by 
the complexes“. Another open issue is the interesting observation that 
several phenotypes related to a deficiency of common inflammasome 
effectors and adaptors cannot be attributed to the well-known NLR 
and PYHIN molecular sensors and their ligands, indicating the exist- 
ence of additional sensors and ligands to be identified. An example of 
a recently identified sensor is IFI16, a sensor of viral DNA". 

The early discovery that mutations in inflammasome components 
in humans are the cause of several inherited autoinflammatory dis- 
eases that can be treated by neutralizing IL-1 illustrates the potential 
detrimental effects of overactivation of inflammasomes’’. However, 
recent evidence from mouse studies on the role of inflammasomes 
in mucosal immunology indicates a compartmentalized and dual 
function (detrimental versus beneficial). To dissect the contribu- 
tions of inflammasomes in different cell types and tissues, novel 
tools such as conditional gene-knockout mice will have to be devel- 
oped. This would allow better investigation of areas that bear funda- 
mental importance and present complex challenges to the research 
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Figure 4 | Inflammasomes regulate multiple 
aspects of tissue homeostasis and immune 
response in the mucosa. NLRP3, NLRP6 
and NLRC4 inflammasomes are formed 

in many lineages of haematopoietic and 
non-haematopoietic cells present in the 
intestine. The activity of these inflammasomes 
regulates epithelial cell regeneration, the 
microflora and mucosal inflammation. All of 
these processes are interconnected, and the 
dysregulation of any of them may increase 

the risk of tumorigenesis. a, Epithelial cell 
regeneration: recognition of as yet unknown 
signals leads to the processing of pro-IL-18 
and potentially unidentified effectors through 
NLRP3, NLRP6 and NLRC4. This allows the 
regulation of epithelial regeneration on injury, 
through effects on epithelial cells, lamina 
propria immune cells, myofibroblasts and 
intestinal epithelial stem cells. b, Regulation 
of microflora: during the steady state, the 
production of IL-18 through NLRP3 and 
NLRP6 in the colon participates in the 
regulation ofa ‘healthy’ microflora, potentially 
through regulation of antimicrobial peptide 
secretion by epithelial cells and immune 

cell subsets. Disruption of the NLRP6 
inflammasome results in aberrant regulation 
of the intestinal microflora, leading to the 
emergence of pro-colitogenic bacteria. c, 
Inflammation: after breaching of the intestinal 
epithelial barrier, the NLRP3 inflammasome 
is involved in the regulation of intestinal 
inflammation in response to microbial 
invasion. Both IL-1 and IL-18 are suggested 
to act as effector molecules, leading to the 
recruitment of multiple cell subsets, including 
neutrophils, that promote local inflammation. 


Chemokines 
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community, such as the involvement of inflammasome signalling 
in circuits regulating metabolism, inflammation and the intestinal 
microflora in mice and humans”. 

Itis worth noting that NLRP6 inflammasome deficiency was recently 
shown to associate with expansion of an autoinflammation-promoting 
microbiota including bacterial taxa such as Prevotellaceae and TM7 
(ref. 91). The mechanisms by which inflammasomes sense microflora 
members or communities, regulate tissue repair and regeneration, 
and orchestrate mucosal immune responses during the steady state 
and inflammation remain to be fully characterized. Interestingly, both 
inflammasome deficiency and alterations in the intestinal microflora 
were previously linked in separate studies to a propensity for the devel- 
opment of elements of the metabolic syndrome, such as obesity and 
atherosclerosis, in mice and humans”””*. Indeed, recent characterization 
of the microflora community in a cohort of individuals of European 
origin identified the bacterial family Prevotellaceae as a prominent 
component of one human microflora cluster that featured a trend 
towards increased body mass index”. Moreover, faecal contents of rural 
African children featured expanded representation of Prevotellaceae 
that was absent in their European counterparts, probably reflecting a 
combination of genetic and environmental factors that differentially 
regulate the microflora composition’®’. Whether the regulation of the 
gut flora by inflammasomes affects weight, metabolism and inflamma- 
tion is expected to become a major interest of the field. 
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Whereas the first decade of inflammasome research has been 
characterized mostly by in vitro studies and in vivo studies with small 
animal models, the contribution of altered inflammasome function 
to complex human disease is expected to gradually take centre stage. 
This, together with a vast effort to discover and develop small-mol- 
ecule approaches to the specific inhibition of inflammasome activa- 
tion, may pave the way to therapeutic intervention targeting inflam- 
masome-regulated pathways that are involved in the pathogenesis of 
human disease. m 
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The DNA damage response 


and cancer therapy 


Christopher J. Lord’* & Alan Ashworth™* 


Genomic instability is one of the most pervasive characteristics of tumour cells and is probably the combined effect of 
DNA damage, tumour-specific DNA repair defects, and a failure to stop or stall the cell cycle before the damaged DNA is 
passed on to daughter cells. Although these processes drive genomic instability and ultimately the disease process, they 
also provide therapeutic opportunities. A better understanding of the cellular response to DNA damage will not only inform 
our knowledge of cancer development but also help to refine the classification as well as the treatment of the disease. 


(1862-1915) published a prophetic and much-quoted manu- 
script on the genetic basis of cancer’” that in contemporary 
terms’ would be hailed as a great example of ‘translation’ or ‘impact’ In 
his study of chromosome and cell behaviour in the sea urchin, Boveri 
suggested that the pathogenesis of cancer might be driven by a “specific 
”? A century later, genomic 


ki the end of his life, the German biologist Theodor Boveri 


and abnormal chromosome constitution’ 
instability is recognized as a characteristic of most solid tumours and 
adult-onset leukaemias. In most cancers, the instability is obvious as 
alterations in chromosome number and structure, a phenotype termed 
chromosomal instability, and as changes to the structure of DNA, such 
as nucleotide substitutions, insertions and deletions. When they occur 
in crucial ‘driver’ genes (of which there are probably fewer than ten 
per tumour) these mutations can alter cell behaviour, confer a selective 
advantage and drive the development of the disease. Importantly, these 
mutations can also influence how the tumour will respond to therapy. 
Alongside key driver mutations, emerging data from cancer genome 
sequencing suggests that a typical tumour may contain many thousands 
of other genetic changes. These ‘passenger’ mutations do not contribute 
directly to the disease, but are probably collateral damage from exposure 
to various environmental factors or defects in the molecular mecha- 
nisms that maintain the integrity of the genome. 


DNA damage and the DNA damage response 

Given the potentially devastating effects of genomic instability, cells 
have evolved an intricate series of interlocking mechanisms that main- 
tain genomic integrity’. The size of this task is daunting; the integrity 
of DNA is continually challenged by a variety of agents and processes 
that either alter the DNA sequence directly or cause mutation when 
DNA is suboptimally repaired. For example, the ultraviolet component 
of sunlight can cause up to 1 x 10° DNA lesions per cell per day*, many 
of which are pyrimidine dimers. If left unrepaired, dimers that contain 
cytosine residues are prone to deamination, which can ultimately result 
in cytosine being replaced with thymine in the DNA sequence. Likewise, 
ionizing radiation (for example, from sunlight or cosmic radiation) can 
cause single-strand breaks (SSBs) and double-strand breaks (DSBs) in 
the DNA double helix backbone. If misrepaired — for example, the 
inaccurate rejoining of broken DNA ends at DSBs — these breaks 
can induce mutations and lead to widespread structural rearrange- 
ment of the genome. Environmental agents that cause DNA damage 
and mutation include cigarette smoke (40 years of heavy smoking is 
estimated to cause as many as 1,000 DNA lesions per cell°), industrial 


chemicals, mustard gases used in warfare and a wide variety of drugs 
used in chemotherapy. This final group of agents has been the mainstay 
of cancer therapy for decades and includes drugs that induce covalent 
crosslinks between DNA bases (for example, cisplatin, carboplatin, 
oxaliplatin and mitomycin C), drugs that attach alkyl groups to bases 
(such as methyl methanesulphonate and temozolomide) and agents 
that cause SSBs or DSBs by trapping topoisomerase I or II enzymes on 
DNA (such as camptothecin and etoposide)’. Endogenous processes 
also present a challenge to the integrity of the genome. For example, 
normal cellular metabolism generates reactive oxygen species that can 
oxidize DNA bases and cause SSBs. Similarly, errors in DNA replication 
caused by deoxyribonucleoside 5 ‘-triphosphate (ANTP) misincorpora- 
tion are potentially mutagenic, as is base depurination and deamination, 
an intrinsic chemical liability of DNA’. 

The variety and frequency of DNA lesions are matched by the 
complexity of mechanisms that counteract these threats to genomic 
integrity. Collectively, these mechanisms are known as the DNA dam- 
age response (DDR). In general terms, the DDR can be divided into 
a series of distinct, but functionally interwoven, pathways, which are 
defined largely by the type of DNA lesion they process (Fig. 1). Most 
DDR pathways encompass a similar set of tightly coordinated processes: 
namely the detection of DNA damage, the accumulation of DNA repair 
factors at the site of damage and finally the physical repair of the lesion. 
Most of the subtle changes to DNA, such as oxidative lesions, alkylation 
products and SSBs, are repaired through a series of mechanisms that 
is termed base excision repair (BER). In BER, damaged bases are first 
removed from the double helix, and the ‘injured’ section of the DNA 
backbone is then excised and replaced with newly synthesized DNA®. 
Key to this process are the enzymes poly(ADP-ribose) polymerase 1 and 
2(PARP1 and PARP2), which act as sensors and signal transducers for 
lesions, such as SSBs. In addition to BER, the pool of deoxynucleotides 
(deoxyadenosine triphosphate (dATP), deoxythymidine triphosphate 
(dTTP), deoxyguanosine triphosphate (dGTP) and deoxycytidine 
triphosphate (dCTP)) that provide the building blocks of DNA can be 
chemically modified before they are incorporated into the double helix. 
The nucleotide pool is, therefore, continually ‘sanitized’ by enzymes 
such as nudix-type motif 5 (NUDTS5)’. 

Whereas small base adducts are repaired by BER, some of the bulkier 
single-strand lesions that distort the DNA helical structure, such as 
those caused by ultraviolet light, are processed by nucleotide excision 
repair (NER)*. NER is often subclassified into transcription-cou- 
pled NER, which occurs where the lesion blocks, and is detected 
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Figure 1 | A panoply of DNA repair mechanisms 
maintains genomic stability. DNA is continually 
exposed to a series of insults that cause a range of 
lesions, from single-strand breaks (SSBs) to base 
alkylation events. The choice of repair mechanism 
is largely defined by the type of lesion, but factors 
such as the stage of the cell cycle also have a role. 
Key proteins involved in each DDR mechanism, 
the tumour types usually characterized by DDR 
defects and the drugs that target these defects are 
shown. BER, base excision repair; NER, nucleotide 
excision repair; NHEJ, non-homologous end- 
joining. Figure modified, with permission, from 


Base alkylation 


ref. 72. 
BER Homologous NHEJ NER Mismatch Direct reversal 
| recombination | | repair | 
Proteins PARP1 BRCA1 KU70/80 ERCC4 MSH2 MGMT 
XRCC1 BRCA2 DNA-PK ERCC1 MLH1 
LIGASE 3 PALB2 
ATM 
CHEK1 
CHEK2 
RAD51 
Tumour types Breast, ovarian, pancreatic Xeroderma Colorectal Giloma 
pigmentosa 


Drugs PARP inhibitors, platinum salts 


by elongating RNA polymerase; and global-genome NER, in which 
the lesion is detected not as part of a blocked transcription process but 
because it disrupts base pairing and distorts the DNA helix. Although 
these processes detect lesions using different mechanisms, they repair 
them ina similar way: DNA surrounding the lesion is excised and then 
replaced using the normal DNA replication machinery. Excision repair 
cross-complementing protein 1 (ERCC1) is key to this excision step. 

The major mechanisms that cope with DSBs are homologous 
recombination’ and non-homologous end joining (NHEJ)"’. Homol- 
ogous recombination acts mainly in the S and G2 phases of the cell 
cycle and is a conservative process in that it tends to restore the original 
DNA sequence to the site of damage. Part of the DNA sequence around 
the DSB is removed (known as resection) and the DNA sequence on 
a homologous sister chromatid is used as a template for the synthesis 
of new DNA at the DSB site. Crucial proteins involved in mediating 
homologous recombination include those encoded by the BRCA1, 
BRCA2, RAD51 and PALB2 genes. In contrast to homologous recom- 
bination, NHEJ occurs throughout the cell cycle. Rather than using a 
homologous DNA sequence to guide DNA repair, NHEJ mediates repair 
by directly ligating the ends of a DSB together. Sometimes this process 
can cause the deletion or mutation of DNA sequences at or around the 
DSB site. Therefore, compared with homologous recombination, NHEJ, 
although mechanistically simpler, can often be mutagenic. 

Mismatch repair" is crucial to the DDR. It deals primarily with dNTP 
misincorporation and formation of ‘insertion and deletior loops that 
form during DNA replication. These errors cause base ‘mismatches’ in 
the DNA sequence (that is, non-Watson-Crick base pairing) that distort 
the helical structure of DNA and so are recognized as DNA lesions. The 
recognition of this distortion triggers a procession of events resulting in 
the excision of newly synthesized DNA encompassing the mismatch site 
and the resynthesis of DNA in its place. Key to the process of mismatch 
repair are proteins encoded by the mutS and mutL homologue genes, 
such as MSH2 and MLH1. 

Finally, translesion synthesis and template switching allow DNA to 
continue to replicate in the presence of DNA lesions that would oth- 
erwise halt the process. Translesion synthesis and template switching 
are therefore usually considered to be part of the DDR. In translesion 
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synthesis, relatively high-fidelity DNA replication polymerases are tran- 
siently replaced with low-fidelity ‘translesion polymerases that are able 
to synthesize DNA using a template strand encompassing a DNA lesion. 
Once the replication fork passes the site of the lesion, the low-fidelity 
DNA polymerases are normally replaced with the usual high-fidelity 
enzyme, which allows DNA synthesis to continue as normal. In template 
switching, the DNA lesion is bypassed at the replication fork by simply 
leaving a gap in DNA synthesis opposite the lesion. After the lesion 
has passed the replication fork, the single-strand gap is repaired using 
template DNA ona sister chromatid, similar to the process used during 
homologous recombination. 

Although sometimes considered distinct from the DDR, the 
mechanisms that control the integrity of telomeric DNA at the end of 
each human chromosome also act as a barrier against genomic instabil- 
ity and mutation”, Rather than being an exposed DNA double-helix 
structure at the end of the chromosome, telomeric DNA comprises 
a series of guanine-rich, repetitive DNA sequences. These enable the 
telomeric DNA to be bound in a loop-like structure with a series of 
proteins (telomere repeat binding factor 1 and 2 (TERF1 and TERF2), 
protection of telomeres 1 (POT1), TERF1-interacting nuclear factor 
2 (TINF2)) that form a shelterin complex. This ‘capping’ structure 
prevents the otherwise exposed ends of different chromosomes from 
becoming fused together (a process known as end-end fusion) by the 
DDR. In most somatic cells, the length of telomeric DNA is reduced 
at each cell cycle, a process termed telomere attrition. Eventually, tel- 
omeres reach a crucial length that precludes the formation of an effec- 
tive shelterin complex. In normal cells, this failure in telomere capping 
induces a p53-mediated response that results in cellular senescence, a 
mechanism that ultimately prevents unlimited cell proliferation. When 
the p53 response is abrogated, end-end fusions occur, which leads to 
the formation of chromosomes with two centromeres. At mitosis, each 
centromere in a dicentric chromosome attaches to an opposite spindle 
pole. The physical stress of opposing forces during chromosome segre- 
gation shears dicentric chromosomes, resulting in broken chromosome 
ends. This process is an ideal substrate for the chromosome translo- 
cation, focal DNA amplification and deletion events that potentially 
drive tumorigenesis. Sheared chromosomes could also be the source of 
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the mutagenic ‘big bang’ or, chromothripsis”’, that has been posited to 
occur in some tumours. Whether or not this is the case, compared with 
healthy cells, many pre-invasive and invasive cancers have significantly 
shorter telomeres, a situation in which end-end fusions and dicentric 
chromosome shearing could easily occur”. 

The core DDR machinery does not work alone, but is coordinated 
with a set of complementary mechanisms that are also crucial to main- 
taining the integrity of the genome. For example, chromatin-remod- 
elling proteins allow the DNA repair apparatus to gain access to the 
damaged DNA". Similarly, processes such as homologous recombina- 
tion are dependent on the close proximity of a sister chromatid, and 
probably work in tandem with the chromosome cohesion machinery 
to ensure this is the case. Crucially, DDR core components interact with 
the cell-cycle-checkpoint and chromosome-segregation machinery. 
These interactions allow DNA repair to occur before mitosis takes place 
and ensure that the correct complement of genetic material is passed 
on to daughter cells’. 


Cancer 

One of the most pervasive characteristics of human tumours is genomic 
instability’®. Although the specific DDR defects are not known in most 
cancers, there are several examples in which there is an incontrovertible 
link between a particular DDR dysfunction and the neoplastic pheno- 
type. For example, 15% of sporadic colorectal tumours show an abnor- 
mal shortening or lengthening of dinucleotide repeat sequences. This 
DNA mutation pattern, known as microsatellite instability, is probably 
caused by an inability to repair DNA replication errors when mismatch 
repair is defective. Microsatellite instability is observed not only in spo- 
radic colorectal tumours but also in a familial form of the disease known 
as hereditary non-polyposis colorectal cancer (HNPCC). HNPCC is 
associated with loss-of-function mutations in mismatch repair genes, 
such as MSH2 and MLH1 (ref. 11). 

Similarly, a subset of both sporadic and familial cancers seems to 
be associated with defects in homologous recombination. Integrated 
(genomic, transcriptomic, methylomic) analysis of just under 500 sam- 
ples of high-grade serous ovarian adenocarcinoma has estimated that 
homologous recombination is probably defective in approximately half. 
These homologous recombination defects are largely driven by DNA 
mutations or epigenetic silencing of genes such as BRCA1 and BRCA2 
(ref. 17). Familial forms of breast, ovarian and pancreatic cancer are also 
associated with loss-of-function mutations in homologous recombina- 
tion modifying genes, such as BRCA1, BRCA2, PALB2, ATM, RAD51C 
and RAD51D. A number of rare, syndromic conditions also exist in 
which the underlying genetic determinant is a loss-of-function muta- 
tion in a gene that modulates the response to DSBs’. These conditions 
include ataxia telangiectasia (ATM), Bloom's syndrome, Fanconi anae- 
mia, Rothmund-Thomson syndrome, Werner syndrome and Nijmegen 
breakage syndrome, all of which have variable clinical phenotypes but 
are similar in that they cause extreme radiosensitivity, a characteristic 
of the inability to process DSBs effectively. 

Evidence is also building that the DDR is not only invoked but also 
dysfunctional at an early stage in the development of neoplasia. Markers 
of DSBs, such as nuclear yH2AX foci (a histone phosphorylation event 
that occurs on chromatin surrounding a DSB), are markedly elevated 
in some precancerous lesions'*’’. One hypothesis” proposes that the 
original cause of these effects is oncogene activation. The activation 
of oncogenes such as MYC and RAS stimulates the firing of multiple 
replication forks as part of a proliferative program. These forks rap- 
idly stall, collapse and form DSBs because they exhaust the available 
dNTP pool or because multiple forks collide on the same chromosome. 
Regardless of the mechanism, stalled and collapsed forks normally 
invoke the DDR and cell-cycle checkpoints that enable DNA lesions 
to be repaired before mitosis takes place. For precancerous lesions to 
progress to mature tumours, it is thought that critical DSB signal trans- 
duction and cell-cycle checkpoint proteins, such as ATM, ataxia telangi- 
ectasia and Rad3 related (ATR), and the master ‘gatekeeper’ protein p53 
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become inactivated. With these DDR components rendered dysfunc- 
tional, collapsed forks are not effectively repaired, and cells proceed 
through the cell cycle with DNA lesions intact, increasing the chance 
of mutagenesis’*””. 


Selection of cancer treatment 

Historically, DDR defects have already been exploited in the form of 
many commonly used chemotherapy and radiotherapy regimens. 
One example is the use of platinum salts (carboplatin or cisplatin) 
frequently given in combination with the taxane paclitaxel in patients 
with advanced ovarian cancer. Platinum salts cause DNA inter- and 
intrastrand crosslinks — lesions that are recognized by the DDR and 
repaired by a combination of NER and homologous recombination. 
These agents may be effective in patients with ovarian cancer because 
these tumours commonly harbour defects in homologous recombina- 
tion”, especially high-grade serous ovarian cancers. Estimates vary, 
but somewhere between 37% and 55% of high-grade serous ovarian 
cancers have germ line or somatic mutations in BRCA1 or BRCA2 and 
some have epigenetic inactivation of BRCA1 (ref. 21). Cisplatin also 
improves survival in patients with ERCC1-negative non-small-cell 
lung cancers, but not in those with ERCC1-positive tumours”. The 
NER pathway, and in particular the ERCC1-ERCC4 complex, is essen- 
tial for the removal of platinum adducts, and low levels of ERCC1 are 
associated with an accumulation of adducts and an increased apop- 
totic response to cisplatin. Finally, where alkylating agents work, they 
probably also exploit a DDR defect. Normally, 06-methyltransferase 
(MGMT) directly reverses DNA alkylation events caused by drugs 
such as temozolomide when they occur on the O6 position of guanine 
residues. However, in some tumours, the promoter of the MGMT gene 
is methylated, causing a reduction in MGMT expression. This defect 
causes O* alkylation events formed by temozolomide to persist so that 
they eventually pair incorrectly with thymidine residues during replica- 
tion. The resultant DNA mismatch is recognized by the mismatch repair 
apparatus, which ultimately stimulates cell death. MGMT methylation 
has also been implicated in the response of tumours (such as gliomas) 
to temozolomide, and is now being used as a biomarker to direct the 
use of therapy” (Table 1). 

Analternative approach to chemotherapy is the design of drugs that 
target specific DDR components. Topoisomerase inhibitors, such as 
irinotecan (a topoisomerase I inhibitor) and etoposide (a topoisomer- 
ase II inhibitor), could be regarded as the first generation of DDR ‘tar- 
geted’ agents™. Topoisomerases catalyse the breaking and rejoining of 
the phosphodiester backbone as part of the process that unwinds the 
torsional structure of DNA before processes such as transcription and 
DNA replication; agents that inhibit this function, such as irinotecan 
and etoposide, leave DNA breaks across the genome and have been 
used effectively in some cancers (Table 1). The development of the next 
generation of DDR inhibitors, particularly PARP inhibitors, has given 
considerable impetus to this area. 


PARP inhibitors as targeted therapy 

PARP1 and PARP2 are members of the PARP protein superfamily. 
These enzymes catalyse the polymerization of ADP-ribose moieties 
onto target proteins (PARsylation) using NAD* as a substrate, releas- 
ing nicotinamide in the process. This modification often modulates 
the conformation, stability or activity of the target protein™. The best 
understood role of PARP1 is in SSB repair ((SSBR) a form of BER). 
PARP1 initiates this process by detecting and binding SSBs through 
a zinc finger in the PARP protein. Catalytic activity of PARP1 results 
in the PARsylation of PARP] itself and the PARsylation of a series of 
additional proteins, such as XRCC1 and the histones H1 and H2B; when 
PARP activity is inhibited, SSB repair is compromised”. The develop- 
ment of PARP inhibitors began with the observation that nicotinamide, 
a product of PARP catalytic activity, is itself a weak PARP inhibitor, as 
are nicotinamide analogues, such as 3-aminobenzamide. The screen- 
ing of chemical libraries and the subsequent chemical refinement 
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Table 1 | Examples of DDR inhibitors in clinical use or in development 


Class Name Stage of development Cancer type 
Topoisomerase | lrinotecan Licensed for use Mainly colorectal cancers 
inhibitor 
Topoisomerase II Etoposide Licensed for use For example, testicular cancer, Ewing’s sarcoma, lung cancer, lymphoma, 
inhibitor phosphate lymphocytic leukaemia and glioblastoma multiforme 
DNA protein kinase CC-115 Phase | study under way Advanced solid tumours, non-Hodgkin’s lymphoma and multiple myeloma 
inhibitor 
PARP inhibitor Olaparib 2 phase II studies completed, 9 additional Ovarian, breast, gastric, colorectal and pancreatic tumours and a range of 
phase II studies under way other solid tumours 
Veliparib 16 phase II studies under way Ovarian, breast, gastric, colorectal and pancreatic tumours and a range of 
other solid tumours 
Rucaparib 2 phase II studies recruiting Breast and other solid tumours 
MK4827 Phase | study completed, phase |b Glioblastoma multiforme, melanoma and other solid tumours and 
ongoing leukaemias 
CHEK1 inhibitor AZD7762 Phase | study completed Range of solid tumours 


MGMT inhibitor Lomeguatrib Phase! study under way 


Telomerase inhibitor —Imetelstat 4 phase II studies under way 


Colorectal cancer and other solid tumours 


Solid tumours and leukaemias 


of compounds have led to second generation PARP inhibitors (for 
example, the benzimidazole-4-carboxamide NU1025), and to a third 
generation of clinically usable inhibitors, such as the tricyclic lactam 
indole, AG014699, and phthalazinones, such as AZD2281 (olaparib)”*. 

For cancer therapy, PARP inhibitors are potent chemosensitizers. For 
example, 3-aminobenzamide enhances the cytotoxic effect of the DNA- 
methylating agent dimethyl sulphate, and several preclinical studies 
have shown that clinical PARP inhibitors increase the effects of another 
DNA-methylating agent, temozolomide*”®. In 2005, potent PARP 
inhibitors were shown to selectively inhibit the growth of cells with 
defects in either the BRCA1 or BRCA2 genes, suggesting a new use for 
these agents”””*. PARP inhibition probably works by allowing the per- 
sistence of spontaneously occurring SSBs, or by inhibiting PARP release 
from a DNA lesion. Whichever is the case, both of these DNA lesion 
types could conceivably stall and collapse replication forks, potentially 
creating lethal DSBs”. In normal cells, the effects of PARP inhibition 
are buffered by homologous recombination, which repairs the result- 
ant DSB. However, effective homologous recombination is reliant on 
functioning BRCA1 and BRCA2, so when these genes are defective — as 
they are in tumours of germline BRCA-mutant carriers — DSBs are left 
unrepaired, and potent PARP inhibitors can cause cell death. In some 
in vitro models, cells with BRCA mutations are more than 1,000 times 
more sensitive to PARP inhibitors than cells with wild-type alleles”. 
This genotype-specific selectivity provides a strong argument for these 
agents to be tested clinically as single agents, rather than in combina- 
tion with chemotherapies. Moreover, the sensitivity to PARP inhibi- 
tors seems to be defined more by the BRCA genotype of a cancer cell 
than by its tissue of origin. Breast, ovarian and prostate cancers with 
BRCA mutations all seem to be profoundly sensitive to these drugs. The 
demonstration that BRCA-mutant cells are sensitive to PARP inhibi- 
tors has now become the emblematic ‘poster child’ for exploiting syn- 
thetic lethality in cancer therapy. Synthetic lethality, a genetic concept 
developed by Dobzhansky”, Lucchesi”! and others and championed as 
a therapeutic approach for cancer by Hartwell and colleagues” and Kae- 
lin®, is the process in which different single gene defects are compatible 
with cell viability, but the synthesis or combination of these gene defects 
results in cell death. In this case, BRCA dysfunction is synthetically 
lethal when PARP is inhibited. Furthermore, the demonstration that 
one DDR pathway (homologous recombination) could be targeted by 
inhibition of another (PARP-mediated BER and SSB repair) suggests 
that DDR pathways functionally buffer one another. This hypothesis 
has long been established in model organisms (such as yeast) in which 
synthetic lethalities can be easily identified using genetic screens. These 
studies and the BRCA/PARP paradigm in human tumour cells has 
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inspired the search for other DDR synthetic lethalities that could be 
exploited in the treatment of cancer, including other synthetic lethal 
approaches for BRCA-mutant tumour cells™, as well as for tumour 
cells with other gene defects. For example, the mismatch repair gene 
defects found in colorectal cancer are synthetically lethal with inhibi- 
tion of genes controlling the response to oxidative DNA damage, such 
as POLB”’, POLG” and PINK] (ref. 36). The first therapeutic test of the 
hypothesis that mismatch-repair defective cancers can be treated with 
a synthetic lethality approach” is now in a proof-of-concept clinical 
trial (http://www.clinicaltrials.gov, NCT00952016). 

The preclinical work described above has paved the way for subsequent 
clinical studies of PARP inhibitors (Table 1). Phase I studies have estab- 
lished the safety of olaparib, a potent PARP inhibitor, as a single agent 
and shown that significant and durable antitumour responses can be 
established in patients with BRCA-mutant breast, ovarian or prostate 
tumours**. Furthermore, olaparib does not seem to cause many of the 
side effects associated with standard chemotherapies. In subsequent 
phase I] clinical studies, 40% of patients with breast or ovarian cancer 
with germline BRCA mutations had a favourable response to the drug. 
This is a particularly high response given that the patients in these tri- 
als had been heavily pretreated and had become resistant to a range 
of chemotherapies. Such work has led to a series of additional PARP 
inhibitors being tested in clinical trials”. 

Recently, attention has focused on clinical trials assessing iniparib. 
Unlike drugs such as olaparib and AG014699, iniparib was originally 
proposed to be a non-competitive inhibitor of PARP1 that disrupts the 
interaction between PARP1 and DNA”. Ina phase II trial, iniparib 
elicited a significant response in patients with triple negative (oestro- 
gen-receptor, progesterone-receptor and HER2-amplification nega- 
tive) breast cancer when used in combination with gemcitabine and 
carboplatin. However, a subsequent phase III trial failed to deliver 
the predefined improvements in progression-free or overall survival. 
The reasons for this failure are unclear, although differences in the 
patient populations and study design may have contributed. However, 
the clinical activity of iniparib may not be related to PARP inhibition”; 
this drug has very low PARP inhibition in vitro, and its mechanism 
of action in vivo is unclear***'. We therefore believe that there is lit- 
tle evidence to regard the relative failure of iniparib in phase III trials 
as a PARP-inhibitor class effect. In fact, the clinical observations with 
potent catalytic-site PARP inhibitors have been promising when used 
in appropriate populations”. 

Alongside the clinical trials of PARP inhibitors, preclinical research 
has extended our understanding of how these agents work and how they 
might be used most effectively in the clinic. Since the initial reports of 
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BRCA/PARP synthetic lethality, several other determinants of PARP- 
inhibitor sensitivity have been identified. As expected, most of these 
determinants cause PARP-inhibitor sensitivity by modulating homolo- 
gous recombination (for example, RAD51, ATRX, SHFM1, RPA1, NBN, 
ATR, ATM, CHEK1, CHEK2, miR-182, CDK1, SWI5-SFR1, USP 1/ 
UAF1 and several Fanconi anaemia proteins” ”). Of particular interest 
is the observation that cyclin dependent kinase 1 (CDK1) inhibition 
suppresses homologous recombination and may therefore elicit PARP- 
inhibitor sensitivity both in vitro and in vivo". More extensive studies 
are now required to identify the patient subgroup, and specifically the 
genetic context, in which this combination could be used. In addition, 
mutations in the tumour-suppressor gene PTEN™ and the TMPRSS2- 
ERG fusion’ can sensitize cells to clinical PARP inhibitors; both of these 
changes are relatively common in human tumours so could be used as 
biomarkers to direct the use of these agents. In the case of the PTEN/ 
PARP synthetic lethality, one model suggests that PTEN deficiency 
causes a downregulation in RAD51 levels, therefore disrupting homol- 
ogous recombination”. Finally, several PARP-inhibitor-resistance- 
causing mechanisms have been identified, particularly in BRCA1- or 
BRCA2-mutant tumour models. In two studies, PARP-inhibitor resist- 
ance was caused by secondary mutations in the BRCA1 or BRCA2 genes 
themselves””*'. These mutations restored the open reading frame of the 
BRCAI or BRCA2 gene, enabled the generation of functional proteins 
and restored the ability to repair DNA damage caused by PARP inhibi- 
tors. Importantly, these secondary mutations were observed not only 
in in vitro models but also in ovarian tumours from patients who had 
become resistant to platinum salts such as carboplatin” >’. These drugs 
target cells with deficient homologous recombination, albeit with less 
efficiency than PARP inhibitors. Subsequent studies have suggested 
that these reversion events are relatively common in BRCA-mutant 
cancers that do not respond to platinum salts. This is especially true 
in ovarian cancer, in which more than 42% of the carboplatin-resistant 
tumours tested had secondary mutations that restored the BRCA1 or 
BRCA2 open reading frames™. Taken together, these observations could 
help to explain why some platinum-refractory BRCA-mutant patients 
in PARP-inhibitor clinical trials failed to respond and strengthen the 
argument for the inclusion of biomarkers that monitor BRCA1 and 
BRCA2 sequence and homologous recombination function in future 
trials. Furthermore, the restoration of homologous recombination and 
drug resistance by secondary mutations provides strong confirmation 
that, at least in this setting, PARP inhibitors and platinum salts require 
defective homologous recombination for antitumour activity. Other 
identified mechanisms of PARP-inhibitor response include the upregu- 
lation of genes that encode P-glycoprotein efflux pumps” and the loss 
of TP53BP1 (a DDR protein) in BRCA1-deficient cells, suggesting that 
these mechanisms should also be assessed in clinical biopsies”. 

In addition to dissecting the mechanisms behind PARP-inhibitor 
sensitivity and resistance, there has been considerable debate regard- 
ing the potential of PARP inhibitors in non-familial cancers. Germline 
mutations in BRCA1 or BRCA2 are common in hereditary breast 
cancer, but relatively rare in sporadic breast cancer. However, many 
sporadic tumours have molecular and histopathological features in 
common with BRCA-deficient tumours, a concept originally termed 
BRCAness”™. For example, BRCAness may be caused by methylation 
of the BRCAI gene promoter (which causes epigenetic silencing of 
BRCAI expression) or even somatic mutation of genes that modulate 
homologous recombination. Furthermore, traditional histopathologi- 
cal methods and gene-expression profiling approaches have suggested 
some level of phenotypic or molecular overlap between triple-negative 
breast cancers, basal-like breast cancers and BRCA 1-mutant familial 
breast cancers”. Although the overlap is not absolute, it has led to the 
hypothesis that there may be a subset of sporadic breast cancers that 
exhibit features of BRCAness and may respond favourably to PARP 
inhibitors”’. This hypothesis has yet to be fully tested experimentally, 
in part because existing tumour-cell models of triple-negative breast 
cancer do not encompass the molecular and functional heterogeneity 
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of triple-negative breast cancer. In the clinical setting, patients with 
triple-negative breast cancer do not show a frequent response to either 
olaparib or veliparib”, although the number of patients tested is still 
small. Needless to say, larger studies are needed, as are clinical trials 
that lead to a full drug licence for PARP inhibitors in BRCA-mutation- 
related breast cancer. 


Future issues 

More research is needed into how DDR activity affects the clinical 
efficacy of DNA-damaging chemotherapies, and here we highlight some 
areas that need to be addressed in the future. 


Pharmacological targeting 
Although the DDR has been the subject of intense study, the biochemical 
and genetic interactions between the various DNA repair pathways is 
not completely understood. This understanding would be facilitated 
by the availability of potent and specific DDR-pathway small-molecule 
inhibitors. Past efforts have focused on the protein kinases that trans- 
duce DDR signals (such as ATM, ATRand DNA-PK) and on the PARP 
superfamily members that detect damage and act as signal transduction 
mediators. For example, synthetic lethalities with ATM and DNA-PK 
suggest that ATM inhibitors could increase the potential of DNA-dam- 
aging chemotherapy in TP53-mutant tumours, and DNA-PK inhibi- 
tors could be used in combination with chemotherapy in ATM-mutant 
tumours”. A number of other enzyme classes have also been identified 
as having a role in the DDR, including enzymes that mediate SUMOyla- 
tion, histone modification, acetylation and ubiquitylation”. The avail- 
ability of pharmacological inhibitors of these proteins would facilitate 
the search for additional synthetic lethal interactions. An example is 
the identification of small-molecule inhibitors (such as MLN4924) that 
modulate neddylation, a post-translational modification process analo- 
gous to ubiquitylation. Inhibition of neddylation, which is induced by 
MLN4924, causes stabilization of the DNA-replication licensing factor 
CDT1, DNA synthesis in the absence of cell-cycle progression and acti- 
vation of the DDR. These agents could potentially target tumour cells 
that have existing DNA repair defects or are under replicative stress. 
The clinical use of DDR-pathway drugs remains a challenge. Addition 
of such agents to standard chemotherapy regimens will probably lead to 
greater side effects, and therefore more work in this area is needed on the 
appropriate dosing and scheduling of inhibitors. These studies should 
also address the possibility that targeting the DDR could also increase 
the rate of second cancers or leukaemias, something that has already 
been observed for some chemotherapies. In principle, these iatrogenic 
effects might be avoided if DDR inhibitors could be administered for 
a relatively short duration, a hypothesis that should be tested clinically 
as well as in preclinical models. Finally, combining DDR inhibitors with 
other targeted agents, such as signal transduction inhibitors or targeted 
radiation, has been underexplored and could form the basis of effective 
treatment regimens. 


Cancer stem cells and the tumour micro-environment 

Many human cancers contain cells that share some of the self-renewal 
and differentiation properties of stem cells. These cancer stem cells 
are thought to act as tumour-initiating cells that, after treatment, may 
potentially repopulate the therapy-resistant tumour. With this role in 
mind, understanding the response to DNA-damaging therapy in cancer 
stem cells may prove essential in understanding whether a treatment 
will ultimately work. Evidence is accumulating that the DDR has dis- 
tinct properties in cancer stem cells. These may include controversial 
and unproven concepts such as the ‘immortal strand’ model in which 
parental, but not newly synthesized DNA, is retained in stem cells after 
mitosis; retaining only the parental strand of DNA is thought to limit 
the impact of DNA replication errors to the stem cell population”. 
Whether this is the case or not, a number of studies have suggested 
that resistance to therapy is mediated by altered DDR activity in cancer 
stem cells and tumour-initiating cells. For example, a comparison of 
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Figure 2 | Genomic scars in cancer. Modern sequencing approaches mean 
that the nucleotide sequence and mutational spectrum of tumour biopsies 
can be rapidly determined. This spectrum, or sequence scar, could be used 
to classify each tumour. Classifiers could take into account the frequency 
and location of mutations, as well as their predominant sequence context 
(for example, insertions and deletions that are flanked by small regions of 
microhomology'*”). Such classifiers could then be used to estimate the 
probable outcome for the patient and to select the most appropriate therapy. 
The choice of therapy is defined by the balance of sequence scars found in 
tumour genome biopsies. 


breast tumour biopsies before and after chemotherapy treatment has 
shown an enrichment of cancer stem cells, which suggests an intrinsic 
resistance of these cells. In mouse models, cancer stem cells deficient 
in p53 showed accelerated DNA repair activity™, as well as high levels 
of AKT and WNT signalling, both of which promoted cancer-stem- 
cell survival after exposure to ionizing radiation®. In glioblastoma 
multiforme, cancer stem cells have been characterized on the basis of 
expression of prominin (CD133; now known as PROM1)™. CD133* 
cells are enriched after irradiation of human glioblastoma multiforme 
xenografts, suggesting that cancer stem cells are necessary for tumour 
regrowth after radiotherapy. Cancer stem cells showed more robust 
activation of the DDR and higher intrinsic DNA repair activity. Block- 
ing the kinases CHEK1 and CHEK2 led to enhanced killing of cancer 
stem cells by ionizing radiation™, suggesting that a pharmacological 
approach may overcome this intrinsic resistance. Much more detailed 
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and systematic knowledge of the consequences of DNA damage and 
the DDR in stem cells is required to exploit this important area. 

If we are to optimize therapies that exploit the DDR, then the impact 
of the tumour microenvironment and the non-tumour cells that sit 
within a neoplastic lesion must be considered. For example, non- 
tumour stromal cells form at least half of the cellular complement of 
pancreatic ductal adenocarcinomas. This stromal compartment con- 
tains pancreatic stellate cells, which, in in vitro systems at least, cause 
tumour cells to become resistant to ionizing radiation; this radiopro- 
tective effect could explain the relative radioresistance of pancreatic 
ductal adenocarcinomas. Already, there is a preliminary understanding 
of how stromal and tumour cells interact and communicate to protect 
against ionizing radiation, but the understanding of how the micro- 
environment modulates the DDR is still limited. Given the complexity 
of studying molecular mechanisms in vivo and of effectively model- 
ling cell-cell interactions in vitro, this is not surprising. These techni- 
cal problems are not insurmountable, but they do raise an important 
issue: historically, much of our knowledge of the DDR has originated 
from studies of cells in vitro. Understanding the DDR in vivo is critical 
if cancer therapies are to be modelled correctly. 


The genomic scar 

Historically, cancer taxonomy has mainly been based on the anatomical 
site of a tumour and its clinical and histopathological features. Cancers 
are now increasingly classified according to their underlying pattern of 
pathogenic mutations, a process which is being facilitated by exome or 
whole-genome sequencing of each cancer®. Treatment decisions can 
then be made on the basis of pharmacologically targetable drivers of 
the disease. In addition to identifying the driver genes affected, exam- 
ining the collateral genomic damage caused by DNA repair deficiency 
may help in the classification of cancer and could provide biomarkers 
to inform therapy. In essence, the mutational spectrum of a tumour, 
revealed by its DNA sequence, is a sequence ‘scar’ that reflects both 
the mutagens that a tumour has been exposed to and the repair pro- 
cesses that have operated to mitigate their impact”. Already, there are 
some precedents to suggest that this approach could have merit. For 
example, homologous-recombination deficient tumours have a char- 
acteristic mutational spectrum defined by the use of error-prone DSB 
repair mechanisms™”. Therefore, the wider tumour genome sequence 
or sequence scar in a tumour with defective homologous recombina- 
tion might be used to decide which patients should be treated with 
PARP inhibitors. Microsatellite instability is another example. In many 
cases, microsatellite instability in a tumour reveals both the mutagenic 
nature of DNA replication and the failure to reverse these sequence 
errors when mismatch repair is defective''. Given the impact that the 
DDR can have on both disease development and response to treatment, 
it seems reasonable to consider categorizing tumours according to their 
DNA repair defects and to use this information to personalize treat- 
ment (Fig. 2). Asa result of the speed at which in-depth, whole-genome 
tumour sequences are now being generated, we will soon have a first 
draft of the mutational landscape of all human cancers, which will allow 
these possibilities to be addressed. 

To match these molecular signatures of DNA mutation and repair, 
we require functional biomarkers that identify DDR defects and aid 
in the selection of therapy. Again, there are preliminary signs that this 
could be achievable: a mechanistic determinant of PARP-inhibitor 
sensitivity, RAD51 foci deficiency, can be measured by immunohis- 
tochemical staining in tumour biopsies”’, and the ability to detect 
other DNA repair complexes by immunohistochemistry is improv- 
ing”'. In principle, functional biomarkers should be identifiable for 
most DDR pathways even though significant technical and practical 
issues need to be resolved before such tests can be used in the clinic. 
Because the DDR pathway is fully engaged only after DNA is dam- 
aged, standard tumour biopsies or specimens are not particularly 
useful for measurement of DDR-pathway activity. A functional test 
of DDR-pathway deficiency might therefore need to include some 
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form of genotoxic stress, followed by a measurement of the molecular 
response. For example, to test the homologous recombination defect 
in tumour cells by measuring the RAD51 response, cells must first 
be exposed to an agent that causes DNA damage. These agents could 
include ionizing radiation, chemotherapy or the PARP inhibitor 
itself. Clinically, this could mean that patients need to be treated 
with a DNA-damaging agent before biopsy for the purposes of bio- 
marker assessment or alternatively tumour biopsies would have to 
be exposed to DNA-damaging agents ex vivo. Both scenarios have 
practical or logistical issues. For instance, tumour cells may not 
respond in the same way ex vivo as they do in vivo, and pretreating 
a patient to assess tumour response might not always be feasible 
from a clinical perspective. Nevertheless, molecular and functional 
assays could mean that a tumour could be defined as, for example, 
‘deficient for homologous recombination and potentially sensitive to 
PARP inhibitor’ before treatment rather than simply being classified 
as a ‘non-familial, triple-negative breast cancer. The development of 
functional biomarkers of other DDR pathways would considerably 
advance this area. 

The genomic instability of tumour cells is one of their most pervasive 
characteristics and presents a therapeutic opportunity. Already, DNA- 
damaging chemotherapies have provided the core of cancer treatment 
for the past half-century and, if used appropriately, targeted therapies 
that exploit the DDR could deliver better therapeutic responses in the 
future. Nevertheless, there are still significant unexplored territories for 
the field; an understanding of the DDR in vivo, the definition of clinical 
biomarkers that allow the selection of appropriate therapy and an under- 
standing of how the DDR interacts with the other homeostatic systems 
in the cell are relative unknowns. We believe that the crucial factors that 
have previously allowed this field to progress — the comprehensive study 
of the basic biology of the DDR combined with a desire to apply this 
information in the clinic — will drive significant advances in the future. = 
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The promise of induced pluripotent stem 
cells in research and therapy 


Daisy A. Robinton' * & George Q. Daley’ * 


The field of stem-cell biology has been catapulted forward by the startling development of reprogramming technology. The 
ability to restore pluripotency to somatic cells through the ectopic co-expression of reprogramming factors has created 
powerful new opportunities for modelling human diseases and offers hope for personalized regenerative cell therapies. 
While the field is racing ahead, some researchers are pausing to evaluate whether induced pluripotent stem cells are 
indeed the true equivalents of embryonic stem cells and whether subtle differences between these types of cell might 


affect their research applications and therapeutic potential. 


a flourishing research area, following the achievement of a long- 

standing ambition — the successful derivation of pluripotent 
stem cells from a patient’s cells. Ina momentous contribution, in 2006 
Takahashi and Yamanaka illustrated how cell fates can be altered by the 
ectopic co-expression of transcription factors’. The manipulation of cell 
fates through reprogramming has altered fundamental ideas about the 
stability of cellular identity, stimulating major new directions in research 
into human disease modelling, tissue differentiation in vitro and cellular 
transdifferentiation. Despite heady progress, a major question remains: 
are the new induced pluripotent stem (iPS) cells equivalent to the classic 
embryonic stem (ES) cells and thus a suitable alternative for research 
and therapy? Whereas the initial wave of papers argued convincingly 
that the two cell types were functionally equivalent, a more refined 
analysis of how iPS cells behave in vitro, coupled with genome-wide 
genetic and epigenetic analysis, has revealed numerous subtle but sub- 
stantial molecular differences, probably owing to technical limitations 
inherent in reprogramming. In this Review, we describe the derivation 
of iPS cells, outline the functional assessments of pluripotency, and then 
recount how global assessments of gene expression, gene copy num- 
ber variation, DNA methylation and chromatin modification provide 
a more nuanced comparison of iPS cells and ES cells. We detail how 
these features influence the utility of each of these cell types for disease 
modelling and therapeutics, and offer predictions for the evolution of 
the art of reprogramming somatic cells. 


A fter a decade of constraints, pluripotent stem-cell biology is now 


Pluripotent stem cells 

The years since Takahashi and Yamanaka’s breakthrough have seen a 
flood of papers touting advances in reprogramming technology, includ- 
ing alternative methods for reprogramming and the successful deriva- 
tion of iPS cells from various cell types. Although the field has advanced 
at a breathtaking pace, investigators have recently taken a step back to 
more critically evaluate iPS cells relative to ES cells and have endeav- 
oured to fully understand how these cell populations differ from one 
another in the hope of closing the gap between the two populations. 
Taking clues from the data, it seems that researchers should attempt 
to define each cell type more accurately and to understand its inherent 
properties rather than ask whether these two classes of pluripotent cell 


are identical. Although ES cells and iPS cells are arguably equivalent in 
all their functions, these cells are bound to harbour subtle differences 
and to have distinct but complementary roles in research because of 
their distinct origins and modes of derivation. To appreciate the differ- 
ences between ES cells and iPS cells, we must first define what it means 
to be pluripotent. 

The term pluripotency has been assigned to a variety of cell types 
with a wide range of functional capacities. In its loosest sense, pluripo- 
tent describes a cell that can generate cell types from each of the three 
embryonic germ layers: the endoderm, mesoderm and ectoderm. At 
the strict end of the range of definitions, however, pluripotent describes 
a cell that can give rise to an entire organism, generating every cell 
type within that organism. The property of cell pluripotency was first 
exposed by Driesch in 1891, when he separated the two cells of an 
early sea urchin blastocyst and observed the development of two com- 
plete sea urchins’. Many decades later, studies of embryo aggregation 
and blastocyst chimaerism by Mintz and colleagues’, Gardner* and 
Brinster’, in the 1960s and 1970s, solidified the idea that the cells of 
the inner cell mass of the mouse blastocyst were pluripotent, and the 
isolation of mouse teratocarcinoma stem cells and native ES cells by 
Evans and Kaufman* and Martin’, in 1981, ushered in the era of cul- 
turing pluripotent stem cells in a dish. The first successful isolation of 
human ES cells, by Thomson and colleagues in 1998, brought forth 
a surge of excitement in the scientific community and beyond*. The 
potential to understand early human development, tissue formation 
and differentiation in vitro through studying ES cells seemed to offer 
limitless possibilities. The opportunity to model diseases, discover 
disease mechanisms and, ultimately, use cell therapy for previously 
untreatable conditions was particularly alluring. 

The derivation of ES cells from the human embryo, however, 
sparked controversy in the United States and led to a presidential 
executive order that restricted government funding’. The limited 
numbers of stem cell lines that were approved for research lacked 
the diversity necessary to address some of the most compelling ques- 
tions, particularly those related to modelling and treating disease’. 
Most ES cells represented generic cells isolated from presumably 
normal embryos — except for those from embryos that had been 
tested by pre-implantation diagnostics and found to carry genetic 
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Figure 1 | Morphology of pluripotent stem cell types. Mouse ES (a) and 

iPS (b) cells form dome-shaped, refractile colonies. These colonies are in 
contrast to the flat morphology of mouse epiblast-derived stem cells (f), which 
resemble human ES (d) and iPS (e) cells. Human iPS cells induced into a naive 
pluripotent state by treatment with chemical inhibitors” (c) parallel the 
morphology of mouse ES and iPS cells. Scale bars, 50 um. 


diseases. The generic lines were not matched to a particular patient, 
so products derived from them for transplantation purposes would 
face rejection by the transplant recipient’s immune system or neces- 
sitate that the recipient receive lifelong therapy with toxic immu- 
nosuppressive medication. To compound these limitations, when 
human ES cells are cultivated on mouse feeder cells, the human cells 
can incorporate mouse components that render the ES cells subject 
to immune rejection. 

To realize the full potential of ES cells, researchers foresaw that 
customized, personalized pluripotent stem cells specific to each 
patient would be generated by using somatic-cell nuclear transfer 
(SCNT) — the procedure that had been used successfully to clone 
Dolly the sheep from adult mammary cells. Nuclear-transfer-gener- 
ated ES (ntES) cell lines would capture a patient’s complete genome 
in a cell that could be induced to become any tissue, thus allow- 
ing differentiation into disease-relevant cells for analysis or cell- 
replacement therapy. Despite successful proof of principle in mouse 
studies", and the clear distinctions between generating ntES cells 
for medical research and creating cloned blastocysts for reproduc- 
tion, the ethical controversy driven by widespread opposition to 
human cloning has severely curtailed research into human SCNT. 
Only this year, when investigators gained access to a large number of 
human oocytes, was the derivation of pluripotent stem cells through 
human SCNT accomplished”. However, the investigators in this 
study needed to leave the oocyte nucleus intact to derive pluripo- 
tent stem cells, so the resultant cells were triploid, thus affording 
research applications for these cells but limiting their suitability for 
therapeutic use”. 

Despite the many hindrances to the study and derivation of 
human ES and ntES cells over the past decade, great strides were 
being made in understanding the pathways that regulate the main- 
tenance and pluripotency of ES cells. This progress was not lost on 
those seeking an alternative source of personalized patient-specific 
stem cells, and in 2006 Takahashi and Yamanaka announced the suc- 
cessful derivation of iPS cells from adult mouse fibroblasts through 
the ectopic co-expression of only four genes’. In an elegant screen 
of 24 gene candidates selected for their links to ES-cell pluripo- 
tency, these researchers found four factors that were sufficient to 
reprogram adult fibroblasts into iPS cells: OCT4 (also known as 
POUSF1), SOX2, Kriippel-like factor 4 (KLF4) and c-MYC. This 
historic contribution inspired an astonishing flurry of follow-up 
studies, with successful reprogramming quickly translated to 
human fibroblasts’** and then to a wide variety of other cell types, 
including pancreatic 8 cells'®, neural stem cells'”"*, mature B cells’”, 
stomach and liver cells”, melanocytes”', adipose stem cells” and 
keratinocytes”, demonstrating the seemingly universal capacity to 
alter cellular identity. 
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Mouse and human iPS cells differ in appearance. Mouse iPS-cell 
colonies appear more dome-like and refractile than human iPS-cell 
colonies. Human iPS-cell colonies are flatter than those of mice and 
are akin to a distinct type of pluripotent stem cell that is derived from 
the epiblast of the early mouse embryo”, a feature that indicates that 
mouse and human iPS cells, like mouse and human ES cells, probably 
reflect distinct developmental states (Fig. 1). The pluripotent state 
of mouse stem cells is called a ‘naive’ state because it closely resem- 
bles the most primitive state, or ground state, of the mouse inner cell 
mass; this is different from the more ‘primed’ state of human stem 
cells, which proliferate in response to different cytokines, reflecting 
the distinct developmental states of these populations”. Regardless 
of the method of derivation, iPS cells maintain the key features of ES 
cells, including the ability to propagate in culture indefinitely and the 
capacity to generate cells from each of the three embryonic germ layers 
(see ref. 26 for a review). Such broad similarities are not proof that iPS 
cells are molecularly or functionally equivalent to ES cells. Yamanaka’s 
intention was to derive an alternative source of pluripotent stem cells 
with the same range of functions as ES cells but offering even greater 
potential for clinical use. To determine the degree of success garnered 
by reprogramming, we must explore the set of assays that were devel- 
oped to assess the key characteristic of ES cells: pluripotency. 


Assessment of pluripotency 

In the past few years, consistent standards for the identification and 
evaluation of iPS cells and for the assessment of their functional equiv- 
alence to ES cells have become widely accepted”. A variety of repro- 
gramming methods have been developed to derive iPS cells, and each 
has advantages and disadvantages (Table 1). Assessing reprogramming 
begins with identifying compact colonies that have distinct borders 
and well-defined edges, and are comprised of cells with a large nucleus, 
large nucleoli and scant cytoplasm. A wide range of colony morpholo- 
gies result from reprogramming, and although many colonies appear 
morphologically similar to ES cells, only a subset of these have com- 
parable molecular and functional features. To accurately distinguish 
reprogrammed, bona fide iPS cells from those that are only partially 
reprogrammed, investigators look for a series of molecular hallmarks. 


Markers of pluripotency 

Fully reprogrammed cells express a network of pluripotency genes, 
including OCT4, SOX2 and NANOG, in levels comparable to ES cells, 
and they reactivate telomerase gene expression, downregulate THY1 
and upregulate SSEA1 (ref. 28). Positive staining for alkaline phos- 
phatase activity has been widely used as a marker of pluripotency; how- 
ever, recently published data have shown this to be insufficient as a test 
for true iPS cells, because intermediately reprogrammed cells also stain 
positively”. The same report shows that iPS cells that are generated by 
virus-mediated reprogramming silence proviral genes when the endog- 
enous pluripotency genes are activated, and that this event is paired with 
the expression of the embryonic antigens SSEA3, TRA-1-60, TRA-1-81, 
DNA methyltransferase 38 (DNMT38) and REX] (ref. 29). Genome-wide 
epigenetic reprogramming is crucial for deriving fully reprogrammed 
cells, and the degree of success is measured, in part, by evaluating the 
methylation status at the promoters of the genes responsible for maintain- 
ing pluripotency, as well as at the genes important for driving differentia- 
tion”. A crucial event during epigenetic reprogramming is the reactiva- 
tion of the silent X chromosome, which occurs late in reprogramming 
and represents a hallmark of ground-state pluripotency*”™*". IfiPS cells 
acquire all of these molecular features, they are expected to behave like ES 
cells and to demonstrate reprogramming-factor independence, which is 
marked by silencing of the proviral transgenes. Variations in epigenetic 
reprogramming, the extent of methylation, the persistence of expression 
of integrated proviruses and other known and unknown factors can alter 
the differentiation potential of iPS cells. Because of the potential for het- 
erogeneity, it is essential to know as much as possible about the nature of 
acell line before labelling it pluripotent. 
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Table 1 | Methods for reprogramming somatic cells to iPS cells 
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Vector type Cell types Factors* Efficiency (%) Advantages Disadvantages 
Retroviral!!*8*83 Fibroblasts, neural stem OSKM, OSK, ~0.001-1 Reasonably efficient Genomic integration, 
cells, stomach cells, OSK + VPA, or incomplete proviral silencing 
iver cells, keratinocytes, OS + VPA and slow kinetics 
amniotic cells, blood cells 
and adipose cells 
Integrating Lentiviral’®1°848° Fibroblasts and OSKM or ~0.1-1.1 Reasonably efficient and Genomic integration and 
eratinocytes miR302/367 transduces dividing and non- __ incomplete proviral silencing 
cluster + VPA dividing cells 
Inducible Fibroblasts, B cells, OSKM or ~0.1-2 Reasonably efficient and allows Genomic integration and 
lentiviral???° eratinocytes, blood cells © OSKMN controlled expression of factors requirement for transactivator 
and melanocytes expression 
Transposon®° Fibroblasts OSKM “0.1 Reasonably efficient and no Labour-intensive screening of 
genomic integration excised lines 
Excisable loxP-flanked Fibroblasts OSK ~0.1-1 Reasonably efficient and no Labour-intensive screening of 
lentiviral®” genomic integration excised lines, and /oxP sites 
retained in the genome 
Adenoviral®&°? Fibroblasts and liver cells OSKM ~0.001 o genomic integration Low efficiency 
Non- 
integrating — Plasmid®°*! Fibroblasts OSNL ~0.001 Only occasional genomic Low efficiency and occasional 
integration vector genomic integration 
Sendai virus” Fibroblasts OSKM J] No genomic integration Sequence-sensitive RNA 
replicase, and difficulty in 
purging cells of replicating 
virus 
Protein?" Fibroblasts os ~0.001 No genomic integration, Low efficiency, short half-life, 
direct delivery of transcription and requirement for large 
factors and no DNA-related quantities of pure proteins and 
complications multiple applications of protein 
Modified mRNA® Fibroblasts OSKM or ~144 No genomic integration, Requirement for multiple 
DNA free OSKML + VPA bypasses innate rounds of transfection 
antiviral response, faster 
reprogramming kinetics, 
controllable and high efficiency 
MicroRNA? Adipose stromal cells and miR-200c, =O] Efficient, faster reprogramming Lower efficiency than other 
dermal fibroblasts miR-302s or kinetics than commonly used = commonly used methods 
miR-369s lentiviral or retroviral vectors, 


no exogenous transcription 
factors and no risk of 
integration 


*OSKM and similar factor names represent combinations of reprogramming factors: K, KLF4; L, LIN28; M, c-MYC; N, NANOG; O, OCT4; S, SOX2; and VPA, valproic acid. 


Functional assays of pluripotency 
When iPS cell lines are isolated and documented to carry the molecular 
features of fully reprogrammed cells, they are typically also assessed 
in functional assays. Characterization of the functional abilities of iPS 
cells begins with in vitro differentiation. The cells can be differenti- 
ated as embryoid bodies — compact balls of loosely organized tissues 
that resemble the gastrulating embryo — or through two-dimensional 
directed differentiation in a culture dish. Such cultures can then be 
assessed for markers of each of the three germ layers. Analysis of the 
pluripotency of mouse cells typically entails the development of a chi- 
maera, which evaluates the potential of iPS cells to contribute to the 
normal development of adult tissues after injection into the blastocyst. 
Whether germline transmission occurs after blastocyst chimaerism 
is measured by the ability of chimaeras to produce all-iPS-cell mice 
in their offspring. These offspring have the genomic integrity of the 
injected iPS cell line, as well as the ability to form functional germ cells. 
The highest stringency test for mouse iPS cells — tetraploid comple- 
mentation — entails the injection of iPS cells into tetraploid blastocysts 
to measure the ability of the iPS cells to direct the normal development 
of an entire organism. This test has been accomplished for only a limited 
subset of iPS cells”, although with an efficiency that parallels tetra- 
ploid complementation carried out with ES or ntES cells******, 

The current functional gold standard for human iPS cells involves the 
evaluation of teratoma formation. In this assay, the in vivo differentiation 


potential of human iPS cells is measured after their injection subcu- 
taneously or intramuscularly into immunodeficient mice*”™. If the 
cells are truly pluripotent, they will form well-differentiated tumours 
comprised of elements from each of the three germ layers. This assay 
provides information about the spontaneous differentiation potential 
of the injected iPS cells. Although it is the most stringent assay avail- 
able for human iPS cells, it is not powerful enough to assess whether 
iPS cells can produce all the cell types of the human body, and it cannot 
assess the contribution of iPS cells to the germ line. The caveat to all 
these functional assays lies in the fact that the standards for iPS cells 
are still hotly debated, especially when anticipating the use of iPS cells 
for therapy”. Adopting a consistent set of standards that can be applied 
uniformly worldwide is essential as stem-cell research and applications 
move forward. 


Functional differences between iPS cells and ES cells 

Despite the multitude of assays used to evaluate pluripotency, and 
although many parallels have been found between iPS cells and ES 
cells, there is a wide range of evidence showing that there are subtle 
yet substantial differences between these cell types. Disparities were 
first observed in the differentiation abilities of iPS cells and ES cells 
in both teratoma-forming and in vitro differentiation assays. Some 
mouse iPS cells showed lower efficiencies of teratoma formation than 
mouse ES cells, whereas some human iPS cells showed less propensity 
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to differentiate along haematopoietic, neuroepithelial and neuronal 
lineages than human ES cells**”. Some researchers interpreted these 
findings to mean that iPS cells have an intrinsically lower differentia- 
tion capacity than ES cells’, whereas other research groups have offered 
different explanations, including that the cell of origin might have a 
specific effect on the differentiation capacities of the derived iPS cells. 

The results from cell-of-origin studies indicate that the parental cell 
can influence the differentiation capacity of the resultant iPS cells. In one 
study, mouse bone-marrow-derived and B-cell-derived iPS cells showed 
more efficient differentiation along haematopoietic lineages than did 
fibroblast-derived iPS cells or neural-progenitor-derived iPS cell lines. 
Interestingly, treatment of the neural-progenitor-derived iPS cells with 
trichostatin A, a potent histone-deacetylase inhibitor, plus 5-azacyti- 
dine, a methylation-resistant cytosine analogue, increased the blood- 
forming capacity of these cells, suggesting that their limitations were 
due to epigenetic modifications. Whereas the bone-marrow-derived 
and neural-progenitor-derived iPS cells contributed well to all tissues 
in the chimaera assay, including to the germ line, the fibroblast-derived 
iPS cells contributed only poorly”. This study laid some of the early 
groundwork for later lines of investigation that probed the molecular 
differences between iPS cells and ES cells, and provided an explanation 
for the functional differences between these cells. 

One investigation found that some iPS cells derived from human 
retinal-pigment epithelial cells show an increased propensity to differ- 
entiate back into this cell type than do ES cells or iPS cells derived from 
other tissues“. More recently, Bar-Nur and colleagues showed that iPS 
cells generated from human pancreatic islet B cells retain open chroma- 
tin at the loci of key B-cell genes and that this correlates with a greater 
capacity to differentiate into insulin-producing cells both in vivo and 
in vitro than that of ES cells or isogenic non-B-cell-derived iPS cells”. 
These functional differences extend beyond differentiation and potency 
to disease modelling. For example, fragile X syndrome is caused by aber- 
rant silencing of the FMR1 gene during human development; iPS cells 
that were reprogrammed from adult skin fibroblasts from an individual 
with fragile X syndrome failed to reactivate the FMRI gene, whereas 
ES cells derived from embryos with this syndrome, as diagnosed by 
pre-implantation testing, expressed FMR1 (ref. 46). Consequently, the 
potential for epigenetic memory in the fragile-X-syndrome-derived iPS 
cells, and substantial differences between fragile-X-syndrome-derived 
iPS and ES cells, must be considered when studying this condition and 
potentially many other conditions. To determine whether the pluripo- 
tent cells being used are appropriate to address a particular question or 
to use in a given application, it is crucial to compare not only the in vivo 
and in vitro differentiation potentials but also the genetic and epigenetic 
disparities that underpin these functional differences. 


iPS cells versus ES cells 

Refined analyses, described in this section, have addressed whether 
iPS cells are suitable alternatives to classic ES cells for use in research 
and therapy. 


Genetics and epigenetics 

Global gene-expression analysis and bisulphite genomic sequencing 
accompanying early derivations of iPS cells provided the initial evidence 
for differences between iPS cells and ES cells at the epigenetic level”. 
Further exploration of these differences led to the identification of only 
a few, seemingly consistent, differences in global gene expression that 
were more pronounced in earlier passages of iPS cells*’. Many of the 
differentially expressed genes were imprinted in ES cells®. 

Looking beyond the expression patterns to the DNA sequence itself 
has revealed genetic variation between iPS cells and ES cells. A recent 
publication suggested that chromosomal aberrations are a common 
feature of stem-cell populations that are propagated in vitro, with each 
type of stem cell — whether ES cell, iPS cell or multipotent stem cell — 
being prone to distinct abnormalities”. Both human iPS and ES cells 
showed a tendency for gains at chromosomes 12 and 17. Whereas iPS 
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cells had additional gains at chromosomes 1 and 9, ES cells had additional 
gains at chromosomes 3 and 20. Other work identified an accumula- 
tion of point mutations in reprogrammed cells, particularly occurring 
in oncogenic pathways”, whereas another study noted an increase in 
copy number variants (CNVs) in early-passage human iPS cells relative 
to intermediate-passage iPS cells or ES cells”. The number and size of 
these CNVs were negatively correlated with the passage number in iPS 
cells, suggesting that a selective disadvantage is conferred by these aber- 
rations. A comparison of iPS cells and their parental cell of origin showed 
that the majority of CNVs were created de novo in fragile regions of the 
genome’. A comprehensive study by Laurent and colleagues found a 
higher frequency of subchromosomal CNVs in pluripotent cell samples 
than in non-pluripotent cell samples”. This work uncovered variation 
between genomic regions enriched for CNVs in human ES cells and iPS 
cells. A small subset of samples of ES cells harboured a large number 
of duplications, whereas several iPS-cell samples contained moderate 
numbers of deletions. Reprogramming was associated with deletions 
in tumour-suppressor genes, whereas extended time in culture led to 
duplications of oncogenes in human iPS cells. 

The finding that human iPS cells derived from a variety of tissues have 
residual, persistent donor-cell-specific gene-expression patterns sparks 
the question of whether the current measure ofa fully reprogrammed 
cell is sufficient” or whether iPS cells retain some type of ‘somatic mem- 
ory from their past identity. To understand this observation better, a 
more detailed analysis at the epigenetic level is required. 

Reprogramming cells to a pluripotent state entails global epigenetic 
remodelling and introduces epigenetic changes, some of which are 
necessary for reprogramming to occur and others of which are inad- 
vertently introduced during the process. A failure to demethylate pluri- 
potency genes is associated with partial reprogramming in iPS cells. 
Whole-genome profiling of the DNA methylomes of multiple human 
iPS and ES cell lines, as well as somatic and progenitor cell lines, from 
different laboratories using different reprogramming techniques and 
with a variety of cells derived from distinct germ layers has shown that 
although overall iPS-cell DNA methylomes closely resemble human 
ES-cell DNA methylomes, iPS cells have significant variability in their 
somatic memory, as well as aberrant iPS-cell-specific differential meth- 
ylation. Some studies have suggested that this occurs in a passage- 
dependent manner, but others have shown that differentially methylated 
regions (DMRs) in iPS cells are transmitted to differentiated progeny at 
ahigh frequency and cannot be erased through passaging”’**>*. Overall, 
there are remarkable global similarities between the DNA methylomes 
of generic iPS and ES cells; however, a core set of DMRs that seems to 
represent hot spots of failed epigenetic reprogramming has been identi- 
fied’’. These DMRs are enriched for genes that are important for devel- 
opmental processes”””*. The high incidence of unique DMRs in iPS cells 
compared with progenitor somatic cells or ES cells suggests that these 
patterns are stochastic and arise during reprogramming. In the most 
exhaustive comparison so far, Kim and colleagues reported that more 
DMRs were present in mouse iPS cells than in ntES or embryo-derived 
ES cells. However, these DMRs did not pertain to specific loci and thus 
do not represent consistent differences between iPS cells and ES cells. 
This lack of consistency suggests that aberrant DMRs in mouse iPS cells 
reflect the technical limitations inherent in reprogramming, rather than 
indicating loci that can reliably distinguish ES cells from iPS cells”. 

In addition, the residual iPS-cell-specific methylation in many iPS-cell 
isolates links these cells to their tissue of origin and, ultimately, affects 
their differentiation propensity**. Residual signatures can be distinct 
enough to enable the myeloid and lymphoid origins of blood-derived 
iPS cells to be discerned”. In iPS cells derived from non-haematopoietic 
cells, such as fibroblasts and neural progenitors, there can be residual 
repressive methylation at loci that are required for haematopoietic fates, 
reducing the blood-forming potential in vitro”. Exogenous supplementa- 
tion of neural-progenitor-derived iPS cells with the cytokine WNT3A 
can increase the blood-forming potential of these cells, supporting the 
idea that incomplete reprogramming owing to epigenetic marks can be 
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overcome by manipulating the culture conditions. Treatment of cultures 
with demethylating agents or knockdown of DNMT1 expression has 
been shown to convert intermediately reprogrammed cells into fully 
pluripotent cells, further supporting this idea**. When iPS cells are 
forced to differentiate along a particular lineage, they become more 
amenable to generating cells of that lineage after another round of repro- 
gramming”. This finding shows that the differentiation propensity and 
DNA methylation profile can be reset, and it suggests that the ‘epigenetic 
memory of the donor cell can be exploited, especially in cases in which 
directed differentiation is particularly challenging”. 

Another important feature of epigenetic reprogramming is the 
reactivation of the inactivated X chromosome. During normal devel- 
opment in eutherian mammals (those with a placenta), one X chro- 
mosome is randomly inactivated in each cell in females. Whether this 
epigenetic silencing eventis reset in iPS cells remains an area of contro- 
versy, in part because of the poor fidelity of X-inactivation markers in 
pluripotent cells”. Some studies have shown that the majority of female 
human iPS clones retain an inactivated X chromosome (which is tran- 
scriptionally silent)*’”, whereas others have indicated that some human 
iPS clones lose immunostaining for trimethylated H3K27 on the X 
chromosome (a marker of epigenetic silencing), indicating X reacti- 
vation®’. In addition, some of the earliest studies of iPS cells showed 
X reactivation in reprogrammed female mouse fibroblasts’’”*. How- 
ever, recently published data support the finding that X reactivation 
does not occur in human iPS cells and, interestingly, reprogramming 
was found to favour expression of a particular X chromosome when 
induced from a mixed X-inactivated population of fibroblasts”. Finally, 
epigenetic reprogramming sometimes fails to properly restore bivalent 
domains, which mark developmental loci with active and repressive 
histone modifications®. 

Although many of the studies cited here have generated data 
suggesting that there are epigenetic differences between iPS cells 
and ES cells, there are several limitations on extending these data to 
all iPS cells in a more general (and more useful) sense. The published 
comparisons were often made using iPS cells derived from a multi- 
tude of laboratories by a variety of methodologies, and reanalysis of 
the gene-expression microarray data using an unsupervised clustering 
algorithm shows a strong correlation between transcriptional signatures 
and specific laboratories for both iPS cells and ES cells. This finding 
indicates that specific culture protocols and laboratory environments 
can affect the transcriptional profile of iPS and ES cells. Therefore, the 
data produced in a particular laboratory might be specific to the cells 
derived there. 

In addition, most iPS colonies are clones derived from a single 
reprogrammed cell, whereas ES cells used for analysis are typically non- 
clonal. The subcloning of ES cells has revealed genetic and epigenetic 
anomalies that would probably have otherwise gone undetected in the 
heterogeneous ES-cell population”. With regard to somatic memory, 
there is poor overlap between the gene sets that have been reported to 
be characteristic of a particular cell type of origin, suggesting that the 
retention of somatic memory is stochastic and is a reflection of the tech- 
nical failure of reprogramming to fully erase the somatic epigenome. 
To exacerbate the issue, the iPS cells used for comparison often have 
different genetic backgrounds and have frequently been derived from 
fibroblasts that were already heterogeneous in their make-up, affecting 
both the gene-expression patterns and the functionality of the iPS cells. 

Throughout the literature, many publications lack correlation 
between the gene-expression patterns and the epigenetic patterns 
observed. An additional consideration is the presence of different 
viral insertions in individual iPS cell lines, which can also affect the 
functionality of the derived cells’. Evidence to support this idea is 
provided by the reduced number of differences observed among iPS 
cells and between iPS and ES cells when transgenes are removed”. 
Many of the aforementioned studies have focused on differences in 
either transcriptional profiles or changes in epigenetic marks; how- 
ever, the most recent studies have evaluated iPS cells and ES cells from 
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both of these angles in parallel, together with their in vitro differentia- 
tion potential, generating the most comprehensive and compelling 
data that have been published so far. 


Holistic analysis 

Stadtfeld and colleagues explored the epigenetic and functional 
discrepancies between iPS cells and ES cells using a new reprogram- 
ming strategy that allowed direct comparison of genetically matched 
cells derived from the same source™. These authors derived iPS and ES 
cells from mice carrying an integrated doxycycline-inducible reprogram- 
ming cassette in every cell, a strategy that sidesteps the confounding 
effects of variable genetic backgrounds and viral integration that have 
been observed in other studies. The overall messenger RNA and micro- 
RNA expression patterns of iPS cells and ES cells were indistinguishable 
except for the aberrant silencing of a few transcripts localized to the 
imprinted Dik1-Dio3 gene cluster on chromosome 12qF1, a region that 
is important for development. A failure to reactivate this locus meant that 
iPS cells contributed poorly to chimaeras and were unable to generate 
all-iPS-cell mice. By contrast, iPS cells with normal Dik1-Dio3 expres- 
sion contributed to high-grade chimaeras and supported the develop- 
ment of viable all-iPS-cell mice. The treatment of iPS cells that failed to 
reactivate DlkI1-Dio3 with a histone-deacetylase inhibitor rescued the 
ability of these clones to support the development of all-iPS-cell mice 
by relieving this region of aberrant hypermethylation. However, recent 
data from iPS and ES cells derived from a mouse strain carrying a dis- 
tinct reprogramming cassette suggest that different levels of expression 
of reprogramming factors, rather than aberrant silencing of Dik1-Dio3, 
account for the different behaviour of the cell lines™. The disparate results 
from these studies highlight that iPS cells can behave differently based 
on subtle variations in the expression of only a few loci. 

To systematically compare human iPS cells derived from different 
somatic cell types and ES cells, Ohi and colleagues compared ES cells 
with iPS cells reprogrammed from somatic cells representative of the 
three embryonic germ layers®. Transcriptional and epigenetic profil- 
ing of these cells showed transcriptional differences, owing, in part, to 
incomplete promoter methylation, which enabled iPS cells to be dis- 
cerned on the basis of their cell of origin. The differential methylation 
between iPS cells and ES cells did not correlate with varying levels in 
DNA methyltransferases; however, the authors found a nonrandom pat- 
tern of incompletely silenced genes in genetic regions that are isolated 
from other genes that undergo silencing during reprogramming. This 
finding could be explained by inefficient or delayed recruitment of the 
silencing machinery and DNA methyltransferases to particular somatic 
genes because of the isolated location of these genes”. 

In another comprehensive study, Polo and colleagues evaluated the 
effect of cellular origin on the gene-expression pattern, epigenetic prop- 
erties and functional abilities of genetically matched mouse iPS cells”. 
Using the same ‘secondary’ reprogramming strategy used by Wernig 
and colleagues”, whereby reprogramming is assessed using tissues 
from a mouse generated from iPS cells carrying integrated, doxycy- 
cline-inducible reprogramming factors, Polo and colleagues generated 
iPS cells from tail-tip fibroblasts, splenic B cells, bone-marrow-derived 
granulocytes and skeletal muscle precursors, and showed that each iPS 
cell line retained a transcriptional memory of its cell of origin. This 
memory was evident in that markers for each respective cell of origin 
remained actively expressed, and was supported by the finding that 
iPS cell lines derived from the same cell of origin clustered together 
on the basis of global transcriptional data. A similar correlation was 
found on evaluation of methylation patterns, which showed subtle but 
substantial differences, reflecting the consequences of different histone 
marks. The effects of somatic memory extended beyond the genetic 
and epigenetic levels to functional significance, affecting the autono- 
mous differentiation potential of the different iPS cell lines after embry- 
oid-body formation. A clear bias that reflected the cells of origin was 
observed in the iPS cell lines. Notably, the transcriptional, epigenetic 
and functional effects evaluated in early-passage iPS cell lines became 


19 JANUARY 2012 | VOL 481 | NATURE | 299 


© 2012 Macmillan Publishers Limited. All rights reserved 


REVIEW 


less significant with continued passaging. This finding indicated that 
complete reprogramming is a gradual process that extends beyond the 
time frame necessary to observe the activation of endogenous pluripo- 
tency genes, transgene-free growth and differentiation into cell types 
from each of the three germ layers. 


Exploring equivalence 

Having considered data from a multitude of published studies, generated 
by the painstaking efforts of many research groups, we return to our 
earlier question: are iPS cells equivalent to ES cells? The answer is not 
straightforward. Rather, there is an emerging consensus that iPS cells 
and ES cells are neither identical nor distinct populations. Instead, they 
are overlapping, with greater variability inherent within each population 
than between the populations. The heterogeneity and behaviour of each 
class of cells is more complex than has previously been thought. The 
two pluripotent stem-cell types are, in theory, functionally equivalent; 
however, in practice, they harbour genetic and epigenetic differences 
that reflect their different histories. It remains to be seen whether there 
are any consistent molecular distinctions between iPS cells and ES cells. 

It is also important to consider that, in contrast to long-standing 
belief, ES cells themselves have considerable epigenetic heterogeneity 
and have differing propensities for differentiation — much like those 
found in iPS cells°”**. These observations, paired with those discussed 
in this Review, are a call for researchers to take a step back from the 
direct comparison of iPS cells and ES cells, and they highlight the need 
to redefine what it means to belong to either of these cell classes. Some 
researchers have already taken heed of this message and generated a 
bioinformatics assay for pluripotency®, whereas others have pro- 
duced a ‘scorecard’ to evaluate the character of both iPS and ES cell 
lines and predict the quality and utility of any pluripotent cell line in a 
high-throughput manner”. Using DNA methylation mapping, gene- 
expression profiling and a quantitative differentiation assay, Bock and 
colleagues made a systematic comparison of 20 established ES cell lines 
and 12 iPS cell lines”. They confirmed that, despite overall similarities, 
transcriptional and epigenetic variation is common between iPS cell 
lines, between ES cell lines, and between iPS cell and ES cell populations. 
These data provide a reference for the variation among human pluripo- 
tent cell lines, which assists in predicting the functional consequences 
of these differences. We can conclude from these studies that any given 
iPS cell line generated by today’s technology might not be completely 
equivalent to the ideal ES cell. 

The differences between iPS cells and ES cells, as well as those among 
iPS cells, clearly affect the utility of these cells in research, disease model- 
ling and therapeutics, providing an impetus for investigators to evaluate 
their cell populations carefully and precisely. The differences do not 
diminish the potential of iPS cells, given that iPS cells have considerable 
advantages over ES cells. Rather than replacing ES cells with iPS cells, 
it is becoming clear that these two cell types complement one another. 
Researchers are still in the process of developing the necessary protocols 
to harness the potential of iPS cells; however, as it becomes clear how 
to evaluate the genetic, epigenetic and functional status of different iPS 
cell lines, further applications of these cells will be uncovered, and pro- 
gress will be made in creating iPS cell lines and designing protocols to 
accomplish the ambitious goals of the field. 


Medical applications of iPS cells 

Generating patient-specific stem cells has been a long-standing goal 
in the field of regenerative medicine. Despite considerable challenges, 
generating disease-specific and patient-specific iPS cells through 
reprogramming has become almost routine. These cells provide a 
unique platform from which to gain mechanistic insight into a variety 
of diseases, to carry out in vitro drug screening, to evaluate potential 
therapeutics and to explore gene repair coupled with cell-replace- 
ment therapy (Fig. 2). In the past few years, the number of reports 
on applications of iPS cells has steadily increased, testifying to the 
broad influence of this breakthrough technology (Table 2). Despite 
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the continued presence of substantial hurdles, the pace of this work 
is such that no review can capture the current state of the field; thus, 
we point to a few publications that highlight the promising medical 
applications of iPS cells but also indicate their key limitations. 

In 2009, Lee and colleagues harnessed iPS cells to demonstrate disease 
modelling and drug screening for familial dysautonomia, a rare genetic 
disorder of the peripheral nervous system”. In almost all cases, familial 
dysautonomia is caused by a single point mutation in the gene encoding 
the inhibitor of nuclear factor-«B (IkB)-kinase-complex-associated pro- 
tein (IKBKAP) that manifests as an extensive autonomic nervous system 
deficit and dysfunction in small-fibre sensory neurons. Although many 
traditional cell-based models have been used to study the pathogenesis 
of familial dysautonomia and to screen for candidate drugs, none has 
used symptom-relevant human cell types. With the successful derivation 
of iPS cells from patients with familial dysautonomia, investigators pro- 
duced central and peripheral nervous system precursors and subsequently 
found three disease-related phenotypes, thus providing validation that 
disease-relevant cell types could accurately reflect disease pathogenesis in 
vitro”. After screening with multiple compounds, the authors showed that 
the disease phenotype could be partially normalized by kinetin, a plant 
hormone. This initial report demonstrated how iPS cells can facilitate 
the discovery of therapeutic compounds and described how these cells 
provided a platform for modelling different severities of familial dysau- 
tonomia and for generating predictive tests to determine differences in 
the clinical manifestation of the disorder. 

Such applications of iPS cells in drug screening and discovery are 
destined to expand to encompass numerous disease conditions. Sev- 
eral research groups have generated models of long QT syndrome, 
a congenital disease with 12 types, each of which is associated with 
abnormal ion-channel function, a prolonged QT interval on an 
electrocardiogram and a high risk of sudden cardiac death due to 
ventricular tachyarrhythmia. Much work has been carried out in ani- 
mal models to probe the underlying mechanisms of this syndrome, 
but cardiomyocytes have distinct and complex electrophysiological 
properties that differ between species. In addition, the lack of in vitro 
sources of human cardiomyocytes and the inability to model patient- 
specific variations of this disease has impeded studies. 

In a proof-of-principle study for using iPS cells to capture the 
physiological mechanisms of genetic variation, Moretti and colleagues 
differentiated iPS cells from individuals with type 1 long QT syndrome 
into cardiomyocytes and, as predicted, observed prolonged action poten- 
tials in the ventricular and atrial cells”. Using this model system, these 
investigators uncovered a dominant-negative trafficking defect associ- 
ated with the particular mutation that causes this variant of long QT 
syndrome. Further investigation of long QT syndrome iPS-cell-derived 
cardiomyocytes showed that these cells had an increased susceptibil- 
ity to catecholamine-induced tachyarrhythmia, and compounds that 
exacerbated the condition (including isoprenaline) were identified”. 
Treatment of these cardiomyocytes with B-adrenergic receptor blockers 
attenuated the long QT phenotype. 

Type 2 long QT syndrome has also been modelled in cardiomyocytes, 
by Itzhaki and colleagues”. The authors derived type 2 long QT syndrome 
iPS cells to evaluate the potency of existing and new pharmacological 
agents that might exacerbate or ameliorate the condition. Their studies 
show that the long QT syndrome phenotype was aggravated by blockers 
of ERG-type potassium channels, whereas nifedipine, a calcium-channel 
blocker, and pinacidil, an agonist of ATP-sensitive potassium channels, 
both ameliorated the long QT syndrome phenotype, as shown by the 
decreased duration of action potentials in long QT syndrome cardio- 
myocytes, as well as the elimination of early after-depolarizations and the 
abolishment of all triggered arrhythmias. A possible limitation of these 
beneficial drugs is excessive shortening of the action-potential duration, 
leading to short QT syndrome. 

Importantly, these studies established that the iPS-cell model can be 
used to identify complex cardiotoxic effects of drugs, as well as to define 
protective pharmacological agents, including optimal drug dosages. 
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Given the number of drugs that have notoriously been withdrawn from 
the market because of their tendency to induce arrhythmias, it is highly 
likely that the current inadequate approaches for assessing cardiotoxic- 
ity will be complemented by iPS-cell-based assessments of drug effects. 

A study from our laboratory explored dyskeratosis congenita, a dis- 
order of telomere maintenance, and provided an unanticipated insight 
into the basic biology of telomerase that has therapeutic implications”. 
In its most severe form, dyskeratosis congenita is caused by a mutation 
in the dyskerin gene (DKC1), which is X linked, leading to shortened tel- 
omeres and premature senescence in cells and ultimately manifesting as 
the degeneration of multiple tissues. Because the reprogramming of cells 
to an induced pluripotent state is accompanied by the induction of the 
gene encoding telomerase reverse transcriptase (TERT), we investigated 
whether the telomerase defect would limit the derivation and mainte- 
nance of iPS cells from individuals with dyskeratosis congenita. Although 
the efficiency of iPS-cell derivation was poor, we were able to successfully 
reprogram patient fibroblasts. Surprisingly, whereas the mean telomere 
length immediately after reprogramming was shorter than that of the 
parental fibroblast population, continued passage of some iPS cell lines 
led to telomere elongation over time. This process was accompanied by 
upregulation of the expression of TERC, which encodes the RNA subunit 
of telomerase. 

Further analysis established that TERT and TERC, as well as DKC1, 
were expressed at higher levels in dyskeratosis-congenita-derived iPS cells 
than in the parental fibroblasts”. We determined that the genes encoding 
these components of the telomerase pathway — including a cis element 
in the 3’ region of the TERC locus that is essential for a transcriptionally 
active chromatin structure — were direct binding targets of the pluri- 
potency-associated transcription factors. Further analysis indicated that 
transcriptional silencing owing toa 3’ deletion in the TERC locus leads to 
the autosomal dominant form of dyskeratosis congenita by diminishing 
TERC transcription. Although telomere length is restored in dyskeratosis- 
congenita-derived iPS cells, differentiation into somatic cells is accompa- 
nied by a return to pathogenesis with low TERC expression and a decay in 
telomere length. This finding showed that TERC RNA levels are dynami- 
cally regulated and that the pluripotent state of the cells is reversible, sug- 
gesting that drugs that elevate or stabilize TERC expression might rescue 
defective telomerase activity and provide a therapeutic benefit. Although 
we set out to understand the pathogenesis of dyskeratosis congenita with 
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Figure 2 | Medical applications of iPS cells. 
Reprogramming technology and iPS cells have 
the potential to be used to model and treat 
human disease. In this example, the patient has 

a neurodegenerative disorder. Patient-specific 
iPS cells — in this case derived by ectopic 
co-expression of transcription factors in cells 
isolated from a skin biopsy — can be used in one 
of two pathways. In cases in which the disease- 
causing mutation is known (for example, familial 
Parkinson's disease), gene targeting could be 
used to repair the DNA sequence (right). The 
gene-corrected patient-specific iPS cells would 
then undergo directed differentiation into the 
affected neuronal subtype (for example, midbrain 
dopaminergic neurons) and be transplanted into 
the patient's brain (to engraft the nigrostriatal 
axis). Alternatively, directed differentiation of 
the patient-specific iPS cells into the affected 
neuronal subtype (left) will allow the patient's 
disease to be modelled in vitro, and potential drugs 
can be screened, aiding in the discovery of novel 
therapeutic compounds. 
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this study, we showed that a high expression level of multiple telomerase 
components was characteristic of the pluripotent state more generally, 
illustrating how iPS cells can reveal fundamental aspects of cell biology. 

Anindependent study of the reprogramming of cells from patients with 
dyskeratosis congenita confirmed the general transcriptional upregula- 
tion of multiple telomerase components and the maintenance of telomere 
lengths in clones’*; however, in this study, no clones with elongated telom- 
eres were identified. The different outcomes of these studies highlight the 
limitations of iPS-cell-based disease models that are imposed by clonal 
variation as a result of the inherent technical infidelity of reprogram- 
ming”. This point also introduces an additional important consideration. 
Before a given iPS-cell disease model can be claimed to be truly represent- 
ative of the disease, how many patients must be involved, and how many 
iPS cell lines must be derived from each patient? Although the answers to 
these questions are unclear, it is crucial to keep these issues in mind when 
generating disease models and making claims based on these models. 

Although iPS cells are an invaluable tool for modelling diseases in vitro, 
the goal of developing patient-specific stem cells has also been motivated 
by the prospect of generating a ready supply of immune-compatible cells 
and tissues for autologous transplantation. At present, the clinical trans- 
lation of iPS-cell-based cell therapies seems more futuristic than the in 
vitro use of iPS cells for research and drug development, but two ground- 
breaking studies have provided the proof of principle in mouse models 
that the dream might one day be realized. Hanna, Jaenisch and colleagues 
used homologous recombination to repair the genetic defect in iPS cells 
derived from a humanized mouse model of sickle-cell anaemia”’. Directed 
differentiation of the repaired iPS cells into haematopoietic progenitors 
followed by transplantation of these cells into the affected mice led to 
the rescue of the disease phenotype. The gene-corrected iPS-cell-derived 
haematopoietic progenitors showed stable engraftment and correction of 
the disease phenotype. 

In another landmark study from Jaenisch’s research group, Wernig 
and colleagues derived dopaminergic neurons from iPS cells that, when 
implanted into the brain, became functionally integrated and improved 
the condition of a rat model of Parkinson's disease”. The successful 
implantation and functional recovery in this model is evidence of the 
therapeutic value of pluripotent stem cells for cell-replacement therapy 
in the brain — one of the most promising areas for the future of iPS- 
cell applications. 
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Table 2 | Diseases modelled with iPS cells 


Disease Molecular defect of donor cell Cell type differentiated from iPS cells Disease phenocopied Drug or 
in differentiated cells — functional tests 
Neurological 
Amyotrophic lateral sclerosis (ALS) Heterozygous Leul44Phe mutation Motor neurons and glial cells ND No 
in SOD1 
Spinal muscular atrophy (SMA) Mutations in SMN1 Neurons and astrocytes, and mature motor Yes Yes 
neurons 
Parkinson’s disease Multifactorial; mutations in LRRK2 Dopaminergic neurons No Yes 
and/or SNCA 
Huntington's disease 72 CAG repeats in the huntingtin gene None NA No 
Down’s syndrome Trisomy 21 Teratoma with tissue from each of the three Yes No 
germ layers 
Fragile X syndrome CGG triplet repeat expansion resulting None NA No 
in the silencing of FMR1 
Familial dysautonomia Mutation in IKBKAP Central nervous-system lineage, peripheral Yes Yes 
neurons, haematopoietic cells, endothelial cells 
and endodermal cells 
Rett’s syndrome Heterozygous mutation in MECP2 Neural progenitor cells Yes Yes 
Mucopolysaccharidosis type IIIB Homozygous mutation in NAGLU Neural stem cells and differentiated neurons Partially Yes 
(MPS IIIB) 
Schizophrenia Complex trait Neurons Yes Yes 
X-linked adrenoleukodystrophy Mutation in ABCD1 Oligodendrocytes and neurons Partially Yes 
(X-ALD), childhood 
cerebral ALD (CCALD) and 
adrenomyeloneuropathy (AMN) 
Haematological 
ADA SCID Mutation or deletion in ADA None D No 
Fanconi’s anaemia FAA and FAD2 corrected Haematopoietic cells o (corrected) No 
Schwachman-Bodian-Diamond Multifactorial None A No 
syndrome 
Sickle-cell anaemia Homozygous HbS mutation None A No 
6-Thalassaemia Homozygous deletion in the B-globin Haematopoietic cells No 
gene 
Polycythaemia vera Heterozygous Val617Phe mutation Haematopoietic progenitors (CD34*CD35") Partially No 
in JAK2 
Primary myelofibrosis Heterozygous mutation in JAK2 None A No 
Metabolic 
Lesch-Nyhan syndrome (carrier) Heterozygous mutation in HPRT1 None A No 
Type 1 diabetes Multifactorial; unknown B-Cell-like cells (express somatostatin, No 
glucagon and insulin; glucose-responsive) 
Gaucher’s disease, type III Mutation in GBA None A No 
al-Antitrypsin deficiency (AIATD) | Homozygous mutation in the Hepatocyte-like cells (fetal) Yes No 


al-antitrypsin gene 


Together, these findings provide proof of principle for using 
reprogramming with gene repair and cell-replacement therapy for 
treating diseases. Using iPS cells in cell-replacement therapy offers 
the promise of therapeutic intervention that is not compounded 
by the use of immunosuppressive drugs to prevent tissue rejection, 
while harnessing targeted gene-repair strategies, such as homolo- 
gous recombination and zinc-finger nucleases, to repair genetic 
defects. These strategies provide the opportunity for generating an 
unlimited population of stem cells that can be differentiated into 
the desired cell type for studying disease mechanisms, for screening 
and developing drugs or for developing a suitable cell-replacement 
therapy. There have been considerable advances and successes to this 
end; however, selecting an appropriate disease target, directing the 
differentiation of iPS cells into phenotype-relevant cell populations 
and identifying disease-relevant phenotypes remain major hurdles. 
It is unclear whether iPS cells used for cell-replacement therapy 
would completely evade an immune response when returned to the 
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patient, because a recent study has shown the immune rejection of 
teratomas formed from iPS cells, even in syngeneic mice”*. Neverthe- 
less, iPS cells provide a promising model with which to study disease 
mechanisms, discover new therapies and develop truly personalized 
treatments. 


Predictions for the evolution of the art 

Few fields have enjoyed the remarkable upsurge in activity and 
excitement that followed the initial report of the reprogramming of 
somatic cells into iPS cells in 2006. Despite heady progress, crucial 
challenges must be met for the field to realize its full potential. There 
is as yet no consensus on the most consistent or optimal protocol 
for deriving the most reliable and, ultimately, the safest iPS cells. 
Increasing the reprogramming efficiency and effecting reprogram- 
ming without genetically modifying the cells are goals that have been 
achieved. Using more-uniform protocols and more-rigorous controls 
would facilitate experimental and potentially therapeutic consistency 
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Disease Molecular defect of donor cell Cell type differentiated from iPS cells Disease phenocopied Drug or 
in differentiated cells unctional tests 
Metabolic cont. 
Glycogen storage disease la Defect in glucose-6-phosphate gene Hepatocyte-like cells (fetal) Yes ° 
(GSD1a) 
Familial hypercholesterolaemia Autosomal dominant mutation in LDLR Hepatocyte-like cells (fetal) Yes ° 
Crigler-Najjar syndrome Deletion in UGT1A1 Hepatocyte-like cells (fetal) ND ° 
Hereditary tyrosinaemia, type 1 utation in FAHD 1 Hepatocyte-like cells (fetal) ND ° 
Pompe disease Knockout of GAA Skeletal muscle cells Yes ° 
Progressive familial cholestasis ultifactorial Hepatocyte-like cells (fetal) ND ° 
Hurler syndrome (MPS IH) Genetic defect in DUA Haematopoietic cells No ° 
Cardiovascular 
LEOPARD syndrome Heterozygous mutation in PTPN1 1 Cardiomyocytes Yes ° 
Type 1 long QT syndrome Dominant mutation in KCNQ1 Cardiomyocytes Yes ° 
Type 2 long QT syndrome issense mutation in KCNH2 Cardiomyocytes Yes Yes 
Primary immunodeficiency 
SCID or leaky SCID utation in RAG1 None NA ° 
Omenn syndrome (OS) utation in RAG1 None NA ° 
Cartilage-hair hypoplasia (CHH) utation in RMRP None NA ° 
Herpes simplex encephalitis (HSE) utation in STAT1 or TLR3 Mature cell types of the central nervous system No ° 
Other category 
Duchenne muscular dystrophy Deletion in the dystrophin gene None NA ° 
Becker muscular dystrophy Unidentified mutation in dystrophin None NA ° 
Dyskeratosis congenita (DC) Deletion in DKC1 None NA ° 
Cystic fibrosis Homozygous deletion in CFTR None NA ° 
Friedreich’s ataxia (FRDA) Trinucleotide GAA repeat expansion Sensory and peripheral neurons, and Partially ° 


in FXN 


Heterogeneity in causative genes and 
mutations: mutations in RP9, RP1, 
PRPH2 or RHO 


Retinitis pigmentosa 


cardiomyocytes 


Retinal progenitors, photoreceptor precursors, Yes Yes 
retinal-pigment epithelial cells and rod 
photoreceptor cells 


Recessive dystrophic Mutation in COL7A1 Haematopoietic cells, and epidermis-like Partially Yes 
epidermolysis bullosa (RDEB) keratinocytes that differentiate into cells of all 

three germ layers in vivo 
Scleroderma Unknown None NA No 
Osteogenesis imperfecta Mutation in COL1A2 None NA No 


An extended version of this table includes references and more information about drug and functional tests (Supplementary Table 1). ABCD1, ATP-binding cassette, sub-family D, member 1; ADA, 

adenine deaminase; CFTR, cystic fibrosis transmembrane conductance regulator; COL1A2, a2-chain of type | collagen; COL7A1, al-chain of type VII collagen; DKC1, dyskerin; FAA, Fanconi’s anaemia, 
complementation group A; FAD2, Fanconi’s anaemia, complementation group D2; FAHD1, fumarylacetoacetate hydrolase; FMR1, fragile X mental retardation 1; FXN, frataxin; GAA, acid a-glucosidase; 

GBA, acid B-glucosidase; HbS, sickle haemoglobin; HPRT1, hypoxanthine phosphoribosyltransferase 1; DUA, a-L-iduronidase; JAK2, Janus kinase 2; KCNH2, potassium voltage-gated channel, subfamily H 
(eag-related), member 2; KCNQ1, potassium voltage-gated channel, KQT-like subfamily, member 1; LDLR, low-density lipoprotein receptor; LRRK2, leucine-rich repeat kinase 2; MECP2, methyl! CpG binding 
protein 2; NA, not applicable; NAGLU, a-N-acetylglucosaminidase; ND not determined; PRPH2, peripherin 2; PTPN11, protein tyrosine phosphatase, non-receptor type 11; RAG1, recombination activating 
gene 1; RHO, rhodopsin; RMRP, RNA component of mitochondrial-RNA-processing endoribonuclease; RP, retinitis pigmentosa; SCID, severe combined immunodeficiency; SMN1, survival of motor neuron 1; 
SNCA, a-synuclein; SOD1, superoxide dismutase 1; STAT1, signal transducer and activator of transcription 1; TLR3, Toll-like receptor 3; UGT1A1, UDP glucuronosyltransferase 1 family, polypeptide Al. 


between laboratories and would yield standardized cell lines that 
could be used with confidence in both basic and applied studies. 
Barring that, researchers must agree on standards of molecular 
analysis to ensure that the reprogrammed cells that most closely 
approximate the generic state of the naive genome can be identified. 
Because iPS cells are subject to the same type of culture adaptations 
that affect karyotypic integrity as human ES cells”, it is important 
to define protocols that minimize the time in culture. In addition, 
cell lines used in clinical applications will need to be evaluated 
frequently for aberrant culture-induced changes at all stages: from 
the somatic cells to the reprogrammed and differentiated cells*’. 
Understanding the genomic alterations that take place during the 
reprogramming, culture and differentiation of iPS cells will be cru- 
cial for designing experiments and ensuring that the derived cells 
are functional, pure and appropriate for use in research and therapy. 
Minimizing any aberrations is important, but as long as research- 
ers understand that aberrations will arise — and can describe and 


control their effects — even imperfect cells can be used, and prefer- 
ential differentiation can be taken advantage of whenever possible. 
Characteristics of iPS cells that were initially perceived as flaws, 
including varying differentiation propensities, might prove useful 
in clinical settings to generate cell types that have been difficult to 
obtain thus far. 

Generating more stringent markers of pluripotency and assays 
to distinguish the abilities of a given iPS cell line are key priorities. 
Building on the progress that has already been made using ES cells”, 
researchers must continue to improve the understanding of directed 
differentiation and to develop new protocols. With refined differen- 
tiation protocols, researchers will be able to investigate the patho- 
physiological basis of genetic diseases and carry out drug screening 
on affected cell types. These protocols will bring the field a step 
closer to patient-matched cells and tissues for clinical transplanta- 
tion, a long-standing ambition of the stem-cell field that might be 
its ultimate measure of success. = 
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Clonal evolution in cancer 


Mel Greaves’ & Carlo C. Maley” 


Cancers evolve by a reiterative process of clonal expansion, genetic diversification and clonal selection within the adaptive 
landscapes of tissue ecosystems. The dynamics are complex, with highly variable patterns of genetic diversity and 
resulting clonal architecture. Therapeutic intervention may destroy cancer clones and erode their habitats, but it can 
also inadvertently provide a potent selective pressure for the expansion of resistant variants. The inherently Darwinian 
character of cancer is the primary reason for this therapeutic failure, but it may also hold the key to more effective control. 


ancer is a major cause of death throughout the world and, 

despite an extraordinary amount of effort and money spent, 

the eradication or control of advanced disease has not been 
achieved'. Although we have a much greater understanding of can- 
cer biology and genetics’, translation into clinical practice needs 
to allow for the cellular complexity of the disease and its dynamic, 
evolutionary characteristics. These features provide both barriers 
to, and opportunities for, successful treatment. 

In 1976, Peter Nowell’ published a landmark perspective on cancer 
as an evolutionary process that is driven by stepwise, somatic-cell 
mutations with sequential, subclonal selection. This is a parallel to 
Darwinian natural selection, with cancer clones as the equivalent 
of asexually reproducing, unicellular quasi-species. Modern cancer 
biology and genomics have validated cancer as a complex, Darwin- 
ian, adaptive system** (Box 1 and Supplementary Information). 

Cancer-clone evolution takes place within tissue ecosystem habi- 
tats. These habitats have evolved over a billion years to optimize 
multicellular function but restrain clonal expansion of renegade 
cells. However, the resilience of multicellular and long-lived animals 
depends on the phenotypic properties that, if not tightly regulated, 
drive or sustain malignancy: that is, cellular self-renewal and stabi- 
lization of telomeres, which allow extensive proliferation, angiogen- 
esis, cell migration and invasion’®. 

The long time period usually required for cancer symptoms to 
emerge and the complexity of the resultant mutations is, in part, 
a reflection of the sequential and random searches for phenotypic 
solutions to constraints from the micro-environment. The evolu- 
tionary progression of cancer is usually stalled or aborted, as shown 
by the high frequency of clinically covert premalignant lesions” ”’. 
Cancer-suppressive mechanisms relegate most cancers to old age, 
when they have little effect on the reproductive fitness of their hosts. 

Limited resources, environment architecture and other constraints 
of the micro-environment limit the size of solid tumours at every 
stage of their progression. Even advanced malignancies can show 
Gompertzian growth" — the cancer cell doubling time (around 
1-2 days) is orders of magnitude faster than tumour doubling time 
(around 60-200 days)'” — implying that the vast majority of cancer 
cells either die before they can divide" or are kept from dividing by 
the tumour micro-environment. Thus, natural selection in tumours, 
in the same way as selection in organisms, takes place through com- 
petition for space and resources. 

Oncologists change cancer-clone dynamics by introducing a 
potent source of artificial selection in the form of drugs or radiation, 
but evolutionary principles still apply. Usually, treatment will result 


in massive cell death, which provides a selective pressure for the 
proliferation of variant cells that resist treatment (the mechanisms 
for this are discussed later). Furthermore, many cancer therapeutics 
are genotoxic; cells surviving treatment, which could then go on to 
regenerate the cancer, may have mutated further, resulting in cells 
with improved fitness and malignant potential. 

The tools of and insights from evolutionary biology and ecology 
can therefore be applied to the dynamics of cancer before and after 
treatment to explain the modest returns from cancer therapy. We 
show that cancer is an inherently evolutionary process and suggest 
alternative strategies for effective control. 


Mutational drivers and clonal dynamics 

The basic principle of a Darwinian evolutionary system is the 
purposeless genetic variation of reproductive individuals who are 
united by common descent, together with natural selection of the 
fittest variants. Cancer is a clear example of such a system. Most 
mutational processes have a bias at the DNA sequence level. The 
particular mutational spectra in a cancer cell can be a reflection of 
error-prone repair processes or associated with a genotoxic exposure 
(for example, cigarette carcinogens, ultraviolet light and chemother- 
apeutic drugs’). The patterns of genetic instability (chromosomal 
or microsatellite) in cancer cells may reflect exposure to, and the 
selective pressure exerted by, some classes of chemical carcinogens”. 
Nevertheless, for the functions encoded in genes, mutagenic pro- 
cesses are essentially blind or non-purposeful (with the exception 
of intrinsic mutagenic or recombinatorial enzymes preferentially 
targeting lymphoid immunoglobin or T-cell receptor genes’). The 
recurrent, mutation-endowed fitness traits in cancer reflect the 
potent impact clonal selection can have. 

Clones evolve through the interaction of selectively advantageous 
‘driver’ lesions, selectively neutral ‘passenger’ lesions and deleterious 
lesions (a ‘hitchhiker’ mutation in evolutionary biology is equivalent 
to a passenger mutation in cancer biology). In addition, ‘mutator’ 
lesions increase the rate of other genetic changes'*"*, and micro- 
environmental’’ changes alter the fitness effects of those lesions. 
The identification of driver lesions is supported by the independent 
observation that these lesions occur more frequently in multiple 
neoplasms than would be expected in the normal background muta- 
tion rate, that they are associated with clonal expansions'*”” and 
from the type of mutation seen (missense, nonsense, frameshift, 
splice site, phosphorylation sites and double deletions)'*”, par- 
ticularly if the gene involved has a known role in cellular processes 
relevant to oncogenesis. The evidence gained from genetic studies 
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of human tumours should be corroborated with functional tests and 
animal models. Passenger lesion status can also be ambiguous or 
context-dependent: for example, cases of monoallelic loss that only 
impact on function when the second allele of the same gene is lost, 
mutations that only cause a phenotypic effect when another gene 
locus also mutates, or cases in which the mutants are functionally rel- 
evant only in the context of therapeutic responses involving that gene. 

Only a few studies have attempted to quantify the selective advan- 
tage provided by driver mutations. Bozic et al.” (using a non-spatial 
population genetics model of sequential, exponential clonal expan- 
sion) derived a formula for the proportion of expected neutral pas- 
senger mutations versus the proportion of selectively advantageous 
driver mutations as a function of the selective advantage of the driver 
mutations. By fitting this equation to glioblastoma and pancreatic 
cancer resequencing data, the authors estimated that driver muta- 
tions gave an average fitness advantage of only 0.4% (ref. 21). To 
measure the mutant clone selective advantage directly would require 
longitudinal samples of a neoplasm and estimation of the clone sizes 
at each time point. 

The dynamics of somatic evolution depend on the interaction of 
mutation rate and clonal expansion. Mutation rate varies substan- 
tially between different genomic regions” and between different 
types of abnormality (for example, single-base sequence changes 
versus balanced chromosomal rearrangements and gene fusions), 
and mutation rates will increase by the instigation of genetic instabil- 
ity” >. The rate of epigenetic change has been estimated to be orders 
of magnitude higher than that of genetic change’, and could bea 
major determinant of clonal evolution. Natural selection affects epi- 
genetic variation within neoplasms”, because epigenetic changes are 
inherited at cell division and can affect cell phenotypes. Evolution- 
ary biology tools to address many of these mutation rate complexi- 
ties exist (see Supplementary Information), but these remain under 
used in cancer biology”. The traditional model of clonal evolution 
suggests that a series of clonal expansions grows to dominate the 
neoplasm (‘selective sweeps’)'®”'”°, but this can occur only if the 
time to the next driver mutation is longer than the time required for 
a clone to sweep through the neoplasm. In addition, if the second 
mutation occurs in a competitor clone, the expansion of both clones 
is restrained by mutual competition (known as clonal interference)”. 
Given the large population size and high mutation rate typical of 
neoplasms, clonal competition is probably common’*””. This issue 
is best addressed by serial sampling, and limited data suggest that 
parallel clonal expansions occur before subclones begin to domi- 
nate in early cancer development**’. Initial evidence indicates that 
large clonal expansions after cell transformation are rare”’. Direct 
evidence, from serial sampling of oncogenic mutations in advanced 
disease*’, metastasis’ or post-chemotherapy relapses (see Supple- 
mentary Information), indicates selective sweeps originate from 
pre-existing genetic variants or subclones. 


Punctuated equilibrium versus gradualism 

The argument of gradualism versus punctuated equilibrium” (a 
longstanding debate in species evolution) has recently emerged in 
the consideration of the clonal evolution of neoplasms. It is unknown 
whether malignant clones, with their markedly altered genomes, 
evolve gradually through a sequence of genetic alterations and 
clonal expansions; accumulate many lesions over time in a rare, 
undetected subclone that finally appears in a clonal expansion; or 
have a few, large-scale punctuated changes, possibly prompted by 
an acute insult or a single, catastrophic mitotic event that generates 
multiple lesions across the genome (or on a single chromosome, 
known as chromothripsis)*”. Evidence of tens of non-synonymous 
mutations in cancers was interpreted under the assumption that 
they were generated by tens of clonal expansions”. Reconstruction 
of genealogies of neoplastic clones, based on genetic heterogene- 
ity within neoplasms, suggests that clones with ancestral genomes 
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BOX1 
Cancer as acomplex 
system 


. Cancers exist in a variety of taxonomic quasi-classes, genera, 
species, characterized by divergent cells of origin and 
mutational spectra. Each cancer is unique. 

. Cancers evolve over a variable time frame (anywhere from 1 to 
50 years), and the clonal structure, genotype and phenotype 
can shift over time in each patient. Each cancer is, in effect, 
multiple different (subclonal) cancers that occupy overlapping 
or distinct tissue habitats. 

. The number of mutations in a cancer can vary from a handful 
(10-20) to (the more usual) hundreds or thousands. The great 
majority are passengers, and a modest, but undefined, number 
are functionally relevant drivers. The mutational processes are 
very diverse. 

. Cancers acquire, through mutational and epigenetic changes, 
a variety of phenotypic traits that compound to allow territorial 
expansion, by proliferative self-renewal, migration and invasion 
— properties that are cruical to normal developmental, 
physiological and repair processes. 

. Advanced, disseminated or very malignant cancers seem to be 
almost uniquely competent to evade therapy. 

. Most, if not all, of this complexity can be explained by classical 
evolutionary principles. 


are not driven to extinction by later clonal expansions’, which 
allows the history of a neoplasm to be revealed. Breast cancer data™ 
have shown that clones with intermediate genotypes are difficult to 
detect; each clone generates a cloud of genetic neutral or non-viable 
subclone variants around it. A study of B-cell chronic lymphocytic 
leukaemia” suggests that intermediate clones can be detected, but 
at a frequency of <0.001, which was below the detection threshold 
of the breast cancer study”. Intermediate clones may be rare because 
they have had limited potential to expand or because they were once 
common but were outcompeted by more recent clones. 

The frequency of premalignant clonal lesions (or carcinoma in 
situ) substantially exceeds clinical cancer rates’ °. This, as well as 
cancer dormancy” and genetic reconstitution of clonal histories”, 
indicates that cancer clones have long periods of stasis. However, 
cancer-clone evolution probably passes a point of no return, possibly 
at the metastatic growth stage. If unlimited proliferative capacity 
is guaranteed by telomere stabilization”, then clonal expansion is 
stopped only when the size threatens the life of the patient. When 
provided (albeit rarely) with the routes for dissemination and immu- 
noselection, cancer cells can have a parasite-like immortality and can 
re-establish themselves in other individuals®*”””. 


The cancer ecosystem 

Tissue ecosystems provide the venue and determinants for fitness 
selection (the adaptive landscape“). Tissue micro-environments are 
complex, dynamic states with multiple components that can influence 
cancer-clone evolution (Fig. 1). For example, transforming growth 
factor-f is a cancer-ecosystem regulatory molecule”. Other cellular and 
cytokine components of inflammatory lesions are potent and common 
modulators of the cancer-cell ecosystem”. 

The interaction between cancer cells and their tissue habitats is 
reciprocal. Cancer cells can remodel tissue micro-environments and 
specialized niches to their competitive advantage**. Cancer-clone 
expansion is controlled by architectural constraints or barriers, such 
as sequestration of stem cells into crypts in the gastrointestinal tract” 
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Figure 1 | The complexity of tissue 
ecosystems. Exposure, the constitutive 
genetics of the host cells, systemic 
regulators, local regulators and architectural 
constraints all impinge on the evolution of 
somatic cells. 
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and the need for external signals for proliferation and cell survival. 
However, some micro-environmental components can promote 
neoplastic cells; for example, infiltrating macrophages and neovas- 
cularization, in response to anoxia, can support neoplastic cell sur- 
vival and proliferation. Mathematical modelling shows cancer-clone 
evolutionary selection for more robust or malignant phenotypes is 
less likely in more stable or homogeneous micro-environments™. 
Spatial heterogeneity of resources in the primary tumour selects 
for cell migration and emigration, which may explain why there is 
selection for metastasis”. Preclinical models have suggested that 
normalizing the resources across the primary tumour can suppress 
metastasis’’. As clones and subclones expand, migrant cells invade 
new habitats within and between tissues, in which they experience 
new selective pressures that can cause further cancer-cell diversity. 
This malignant feature, and its associated morbidity, characterizes 
end-stage cancer. 

Cancer-cell habitats are not closed systems. The tissue ecosystem, 
in addition to regulation by systemic factors (such as nutrients and 
hormones) or invasion by inflammatory or endothelial cells, is 
modified by external factors. As well as the tissue site, the ecosys- 
tem for each cancer includes environmental, lifestyle and associated 
aetiological exposure of the patient. Genotoxic exposure (such as, 
cigarette carcinogens or ultraviolet light), infection, and long-term 
dietary and exercise habits that affect calorie, hormone or inflam- 
mation levels can have a profound effect on the tissue micro-envi- 
ronments, as well as directly on cancer cells (Fig. 1). These factors 
are the aetiological link to the initiation or progression of cancer, 
and without such modulating exposure, the risk of cancer-clone 
initiation and evolution would be reduced. 

Cancer-tissue ecosystems can be radically altered after 
chemotherapy or radiotherapy. Most cancer cells may be deci- 
mated, but the remodelled landscape creates new selective pressures, 
resources and opportunities that may allow pre-existing variant can- 
cer cells that survived treatment to emerge. Crucially, stroma or spe- 
cialized habitat niches may protect cancer cells against the therapy”. 


Cancer genomics and clonal architecture 

Cancer-genome sequencing, facilitated by the introduction of 
second-generation whole-genome sequencing, has provided further 
insight into the complexity of the genetics and evolutionary biology 
of cancer cells”. In most cases, transformation and metastases are 
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probably clonal’ because they are derived from single cells; there- 
fore, the identification of the mutations present in all of the cells 
of a tumour can help to reconstruct the genotype of the founder 
cell. These founder events limit the genetic and clonal complexity 
of tumours. We already had a long list of recurring driver mutations 
(with gain or loss of function) as a result of the fine mapping of chro- 
mosomal breaks, candidate gene sequencing and functional screen- 
ing of bulk samples from tumours. However, the use of genomic 
screens has demonstrated the scale of cancer-genome complexity. 
Individual cancers can contain hundreds, or tens of thousands, of 
mutations and chromosomal alterations’. The great majority of these 
are assumed to be neutral mutations arising from genetic instability. 
Chromosomal instability (amplifications, deletions, translocations 
and other structural changes) is a common feature, but it is not clear 
whether there is an increased rate of simple base-pair mutations 
in cancer*”’****, Evolutionarily neutral alterations are thought to 
register in the screens because they hitchhike on clonal expansions 
that are driven by selectively advantageous alterations or by drift. 
In addition, data have confirmed that each cancer in each patient 
has an individually unique genomic profile. It is possible that cancer 
cells need only a modest number of phenotypic traits to deal with 
all of the constraints and evolve into a fully malignant or metastatic 
tumour”, but the genomics data suggest that this can be achieved 
by an almost infinite variety of evolutionary trajectories and with 
multiple different combinations of driver mutations”. 
Paradoxically, genome profiles underestimate complexity. So far, they 
have been mostly one-off snapshots from a single sample at a single diag- 
nostic time point. We know that serial or parallel sampling using more 
conventional genetic analysis uncovers genetic diversity within a tumour. 
Whole-genome sequencing of paired primary tumours versus metastatic 
samples has so far been limited, but it has revealed that individual meta- 
static lesions are clonal in origin and genetically unique, yet have a clonal 
ancestry traceable to the primary tumour’. ‘The genome description is 
perhaps also misleading because genetic variants are identified in 5-50% 
of reads, which suggests subclonal distribution of most mutations”, but 
the segregation pattern of mutations within subclones is lost when DNA 
is extracted from the total cell population. This is important if patient- 
specific genomic profiles are to provide a platform for selecting thera- 
peutic targets. Arguably, subclonal genetic diversity is key to the success 
or failure of therapy. This is a considerable challenge, technically and 
bioinformatically, in cancer genomics and will require deep sequencing” 
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Figure 2 | The branching architecture of evolution. a, Cancer clones. 
Selective pressures allow some mutant subclones to expand while others 
become extinct or remain dormant. Vertical lines represents restraints 
or selective pressures. This is a representative pattern for common, solid 
cancers; as recognized by Nowell’, leukaemic clones may expand over 

a shorter time frame (years versus decades), and be subject to fewer 
restraints and mutational events. Ecosystems 1-4 (boxes) represent the 


and investigation of the genomes of single cells for patterns of segregation 
of mutations to understand the genetic diversity within neoplasms and 
how this changes in response to interventions. 


Subclonal segregation of mutations and clonal architecture 

The classic model of clonal evolution suggests there is a sequential 
acquisition of mutations with concomitant, successive subclonal 
dominance or selective sweeps. Histopathological evidence of dis- 
ease progression (adenoma, carcinoma and metastases) supports this 
model. At each stage of this evolution, individual cells and their progeny 
(subclones) compete for space and resources. Multiplexed, single-cell 
mutational analysis (ideally in serial samples) is the most appropriate 
way to examine clonal architecture. So far, there are only a few exam- 
ples of this'’**”’, but they have provided evidence of the complex pat- 
tern of subclonal segregation of mutations — consistent with Nowell’s 
model. The large amount of data from tissue sections, small biopsies 
and, more recently, single-cell analysis” is evidence that the evolution- 
ary trajectories are complex and branching, exactly as Nowell proposed 
and in parallel with Darwin's iconic evolutionary speciation tree (Fig. 
2). Attempts to simplify this complex system into a linear sequence of 
mutational events on the basis of cross-sectional data have probably 
been misleading™. However, by comparing the mutational genomes of 
the subclones, it is possible to discover their evolutionary or ancestral 
relationships, as well as the order of events during the development 
of that neoplasm*”***”°***, Clonal evolution from common ancestral 
cancer cells is demonstrated in identical twins with concordant acute 
leukaemia®”, in metastatic lesions” and, by inference, in some cases 
of bilateral testicular cancer” (Fig. 3). In this context, divergent cancer- 
clone genotypes and phenotypes correspond to allopatric speciation in 
separate natural habitats (for example, Darwin’s finches on the Galapa- 
gos Islands”). 

Profiles of subclones within a neoplasm can be used to determine 
‘molecular clocks’ that can then be linked to time events in the history 
of the neoplasm. For example, DNA methylation changes and base- 
pair mutations have been used to infer clonal expansion dynamics” 
and the time between initiation, invasion and metastasis!”*”"”. It is 
even possible to determine the relative timing of events during pro- 
gression from a single sample, based on deep sequencing”. 

Subclones may be mixed together within the primary tissue 
but given their single-cell origin and bifurcating pathways, it is not 
surprising that they can also occupy distinctive territories"? 
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different tissue ecosystems or habitats. Smaller boxes within Ecosystem 
1 represent localized habitats or niches. Each differently coloured circle 
represents a genetically distinct subclone. Metastatic subclones can 
branch off into different time points in the sequence from either minor 
or major clones in the primary tumour. Tx, therapy. CIS, carcinoma in 
situ. b, Darwin's branching evolutionary tree of speciation from his 1837 
notebook. 


(Fig. 4a). Cancer-clone evolution involves contemporaneous sub- 
clones with distinctive mutational and phenotypic profiles that may 
be territorially segregated, which has considerable practical implica- 
tions for diagnosis, prognosis and targeted therapy based on biopsy 
sampling®. It remains unclear whether all subclonal diversification 
reflects the impact of driver mutations and selective advantage, or 
is also the result of genetic drift of selectively neutral mutations or 
even epigenetic alterations. The level of diversity within the sub- 
clonal structure can be measured***** and has been shown to be a 
robust biomarker for predicting progression to malignancy in Bar- 
rett’s oesophagus”. It is also associated with the tumour stage and 
subtype of breast cancer™. 


Units of selection and cancer stem cells 
Evolutionary theory suggests that natural selection operates in any 
system that has components with varying reproductive potential*. In 
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Figure 3 | Divergent (branching) clonal evolution of cancer with 
topographical separation. In each example, a clonal (single cell) ancestry is 
indicated by a shared acquired mutation (for example, ETV6-RUNX1 fusion 
for leukaemias and KIT mutation for testicular cancers). The time at which the 
two subclones evolve (T, and T,) can be temporarily synchronous or develop 
several years apart’”**’. The probabilities of subclones emerging as shown 
are independent and different (p, and p,). In most cases (90% for monozygotic 
twins), only one twin develops overt leukaemia. The penetrance of bilateral 
testicular cancer having a common origin” is unknown. 
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the progression of cancer, or its resurgence after therapy, the primary 
unit of selection is the cell. This cell has to have extensive replicative 
potential, the so-called cancer stem cell (also known as the cancer- 
initiating or propagating cell) (Fig. 5). 

The cancer stem-cell hypothesis was developed through 
transplantation experiments with leukaemic cells”, and although it 
has been reported to be a general feature of all cancers”, this idea is 
contentious. There has been no consensus on whether cancer stem 
cells are rare or high-frequency cells, or whether they have fixed, 
hierarchical or variable phenotypes, but considering the evolution- 
ary progression in cancer, cells with extensive propagating activity 
are unlikely to be fixed entities. Cancer stem cells are the cellular 
drivers of subclonal expansion and so probably vary in frequency 
and phenotypic features. The only feature they must have is the 
potential for extensive self-renewal (Fig. 5). Quantitative measures 
of stem-cell activity or self-renewal (through xenotransplantation 
or gene-expression signatures) can be used to predict the clinical 
outcome of several cancer types”. The cancer stem cell’s ability to 
self-renew is made stronger by an aberrant genotype and, possi- 
bly, other, epigenetic, features. Several testable predictions can be 
made from this. First, cancer stem cells should evolve and change 
in genotype and phenotype as the cancer evolves before and after 
therapy. Some therapies may even provide a strong selection for 
cancer stem-cell survival and proliferation”’. Second, as cancers 
progress, there should be selective pressure for the cells with the 
most extensive self-renewing capacity, but at the expense of cells 
with the ability to differentiate. This has been observed in chronic 
myeloid leukaemia (CML)” and mouse models”. A higher prob- 
ability of symmetrical self-renewing proliferative cycles would be 
expected to result in an increased number and frequency of cancer 
stem cells. It is therefore of some consequence that loss of the TP53 
DNA damage checkpoint, which frequently correlates with cancer 
progression and clinical intransigence’’, seems to ‘release’ stem-cell- 
like transcriptional signatures”° and leads to enhanced self-renewal 
in mammosphere culture systems”. The frequency of cancer stem 
cells could then increase from low to very high frequency as the 
disease progresses’”*”. Third, for selection to operate through micro- 
environmental or therapeutic pressures, there should be contempo- 
raneous genetic variation in cancer stem cells, which has been shown 
in leukaemias®””. 

These considerations have significant clinical implications. 
Whatever the frequency and phenotype, if self-renewing cancer 
stem cells drive and sustain cancer-clone evolution, this suggests 
they are the repository of functionally relevant mutational events 
that drive clonal selection before and after therapy. This supports the 
view that cancer stem-cell restraint or elimination should be the aim 
of any therapy. However, if cancer stem cells are as genetically (and 
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epigenetically) diverse as evolutionary considerations and initial 
experiments”*”* indicate, this could be the reason for therapeutic 
failure. The adaptability of cancer stem cells provided by genetic 
diversity is added to by what seems to be their intrinsically lowered 
susceptibility to drugs and irradiation”. This may be because of 
the association with stromal cells** and the quiescence of cancer 
stem-cell subpopulations, as well as the properties of enhanced DNA 
repair and elevated expression of drug efflux pumps, which may be 
the evolved contingencies to protect normal stem cells. 

Subclonal genetic heterogeneity is a common, if not universal, 
feature of cancers*’. However, it cannot be assumed that all subclones 
are sustained by cancer stem cells; some could be evolutionary dea- 
dends generated by cells with only limited propagating potential. It 
is partly to accommodate this that the in vivo assay for cancer stem 
cells involves sequential transplants”. Ideally, the genomes of single 
cancer stem cells would be interrogated to investigate how they relate 
to subclones, but this is not currently possible. However, the genetic 
heterogeneity of cancer stem cells can be inferred by comparing 
subclonal diversity or clonal architecture before and after trans- 
plantation. Quadrant sections of glioblastoma have been shown to 
have divergent but related genotypes, but all sections contained cells 
that read-out in the in vivo (intracerebral) cancer stem-cell assay”. 
More definitive data come from comparing pre- and post-transplant 
subclonal genetic profiles that were investigated at the single cell 
level or by single nucleotide polymorphism arrays in B-cell precur- 
sor acute lymphoblastic leukaemia. Multiple subclones from each 
patient’s diagnostic sample registered in the in vivo cancer stem-cell 
transplant assays, albeit with variable competitive potency*****’. We 
are still awaiting experimental confirmation that genetic diversity of 
cancer stem cells is a common feature of cancer, but, assuming that 
it is, this will have important therapeutic implications. 


A Darwinian bypass 

Nowell’ stated in his landmark article “more research should be 
directed towards understanding and controlling the evolutionary 
process in tumours before it reaches the late stage seen in clinical 
cancer”. Although cancer therapy has had its successes, in reality 
very few advanced or metastatic malignancies can be effectively con- 
trolled or eradicated. Genetic variation in cancer stem cells, particu- 
larly if induced by genetic instability, provides the opportunity for 
cells to escape and the therapy to fail. Other, non-genetic, mecha- 
nisms of positive selection by therapy exist, including signalling plas- 
ticity (or oncogene bypass)**, quiescence” and epigenetic changes™; 
however, many of these depend on heritable, and thus selectable, 
epigenetic variation. Great expectation has been placed on the audit 
of cancer genomes that, by identifying recurrent and “druggable” 
mutations, would herald a new phase of highly specific or targeted 


© 2012 Macmillan Publishers Limited. All rights reserved 


small-molecule inhibitors and personalized medicine”. Oncogene 
addiction may be the Achilles heel of cancer in this respect”. The 
success of imatinib and the derivative non-receptor tyrosine (ABL1) 
kinase inhibitors in CML” was very encouraging, but CML is not 
a typical cancer. It is essentially a premalignant (albeit ultimately 
lethal) condition, probably driven by a single founder mutation 
(BCR-ABLI fusion), which provides a universal target for therapy. 
Even in the most favourable of circumstances, escape occurs either 
by quiescence (and coupled resistance) of cancer stem cells” or by 
mutation of the ABL1 kinase target. Once CML has evolved to an 
overt malignancy or blast crisis, with increased genetic complexity, 
ABL1 kinase-directed therapy is often ineffective. 

Other small-molecule inhibitors directed at mutant products have 
produced encouraging results in patients with advanced disease, but 
the benefits are transitory and cancer clones re-emerge with resistant 
features. When the targets selected are non-founder mutations, even 
if they are dominant in the neoplasm, therapy can be predicted to 
select for subclones lacking the mutant target”. Alternatively, sub- 
clones can have additional mutations that allow a bypass of the sig- 
nalling pathway of the drug target, such as the MET proto-oncogene 
(MET) amplification in EGFR mutant lung cancer treated with EGFR 
kinase inhibitors”. 

Supporters of targeted therapy and personalized medicine argue 
that a combination of drugs that target components of networked 
signalling and are tailored to the individual patient’s cancer genome 
is the solution to this problem. In this regard, synthetic lethal strate- 
gies seem promising”. 

Self-renewing cancer cells are the ultimate target for therapy, so 
high-throughput screening for selective inhibitors is an encouraging 
development”’. Ways to target the components of the self-renewing 
process itself (independent of specific mutant genotype) deserve 
exploration, especially if a distinction can be made from normal 
adult stem cells. In the case of CML, intrinsically resistant (and pos- 
sibly quiescent) stem cells, have been targeted by combining selective 
kinase (ABL1) inhibitors with inhibitors of a histone deacetylase” or 
BCL6 (ref. 95). Ultimately, it may prove difficult to thwart the plastic- 
ity and adaptability of cancer cells (or cancer stem cells), which are 
an inherent evolutionary feature of advanced disease, and a ‘Dar- 
winian bypass’ may be required, for which there are a number of 
possibilities. An implication of the evolutionary diversity of cancer 
is that prevention (smoking cessation, avoiding sunburn, prophy- 
lactic vaccines, and so on) makes a great deal of sense, as does early 
detection and intervention (that is, before genetic diversification 
and dissemination become extensive). 

An alternative therapeutic strategy is to focus on the micro- 
environmental habitat using ‘ecological therapy’, which aims to 
change the essential habitat and dependency of the cancer cells”*. 
For example, anti-angiogenesis can provide a potent restraint on 
cancer stem cells’’. Other examples are the use of bisphosphonates to 
remodel bone in patients with prostate cancer, the use of aromatase 
inhibitors in patients with breast cancer, exploiting hypoxia, the use 
of inhibitors of inflammation or tumour-infiltrating macrophages, 
and blocking cancer stem-cell interactions with essential stromal or 
niche components”. 

Another alternative is to control the cancer, rather than eradicate 
it, thereby turning cancer into a chronic disease. Because the speed 
of evolution is proportional to the fitness differential between 
the cells, cytotoxic drugs are predicted to select rapidly for resist- 
ance’. It is thought they cause competitive release” by removing all 
of the competitors of resistant cells. In contrast, cytostatic drugs 
should delay progression and mortality longer than cytotoxic drugs 
because sensitive competitor cells remain in the tissue to occupy 
space and consume resources that would otherwise be used by the 
resistant clones. In addition, by suppressing cell division, cytostatic 
drugs also suppress the opportunities for new mutations. A study 
by Gatenby and colleagues'” showed that by treating an aggressive 
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Figure 5 | Selective pressure on cancer stem cells. Selective pressures 

can include environmentally derived genotoxicity, natural or physiological 
restraints, cancer therapy, and so on. Mutation in progenitor cells can 
convert these cells back to a self-renewing population”. The small blue arrow 
represents a mutation; the large blue arrow represents differentiation: in both 
cases they represent a change in state. In addition to the mandatory trait of 
self-renewal, cancer stem cells (CSC), can exhibit any phenotypic feature that 
allows cells to continue to survive and proliferate in the face of a particular 
constraint. D*, differentiation; SR, self-renewal. 


ovarian cancer (OVCAR-3) xenograft tumour to maintain a sta- 
ble size, rather than to eradicate it, host mice could be kept alive 
much longer. Moreover, the dose of carboplatin necessary to keep 
the tumour at a manageable size declined over time’”’. Researchers 
should now focus on what phenotypes can be selected for to make 
neoplasms less deadly and more clinically manageable. 

The evolutionary theory of cancer has survived 35 years of empirical 
observation and testing, so today it could be considered a bona fide 
scientific theory. The basic components of somatic evolution are well 
understood, but the dynamics of somatic evolution remain unclear. 
Fortunately, there are evolutionary biology tools that may be applied to 
neoplasms to address many of the fundamental cancer biology ques- 
tions, such as the order of events in progression, distinguishing driver 
from passenger mutations, and understanding and preventing thera- 
peutic resistance. The dynamics of clonal diversification and selection 
are critical to understanding these issues. The challenge nowis to use 
the clinical opportunities to address directly the evolutionary adapt- 
ability of neoplasms and design interventions to slow, direct or control 
cancer-cell evolution to delay or prevent mortality. m 
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The contribution of bone to 
whole-organism physiology 


Gérard Karsenty’ & Mathieu Ferron' 


The mouse genetic revolution has shown repeatedly that most organs have more functions than expected. This has led 
to the realization that, in addition to a molecular and cellular approach, there is a need for a whole-organism study of 
physiology. The skeleton is an example of how a whole-organism approach to physiology can broaden the functions of a 
given organ, reveal connections of this organ with others such as the brain, pancreas and gut, and shed new light on the 
pathogenesis of degenerative diseases affecting multiple organs. 


here are two definitions of physiology. In its most modern 

incarnation, physiology focuses on signalling and gene expression 

events occurring within cells as they relate to a given function and 
is often referred to as molecular and cellular physiology. An older physi- 
ology deals with events occurring and molecules acting outside the cell. It 
can be called whole-organism physiology and aims at identifying interac- 
tions taking place between organs through what Claude Bernard called 
the “milieu intérieur”’. Its study relies on three basic principles. First, the 
premise of whole-organism physiology is that no function is determined 
by one organ alone; this hypothesis has been verified multiple times 
through the use of model organisms” *. Second, homeostasis, a princi- 
ple positing that different organs exert opposite influences on the same 
function to regulate it tightly, applies to most physiological functions’. 
Closely related to homeostasis is the cardinal rule of endocrinology, that 
is feedback regulation: a regulated organ talks back to a regulating one 
to limit its influence’. Third is the principle in which whole-organism 
physiology resembles any other aspect of biology: regulatory molecules 
appear during evolution with the functions they regulate, not millions 
of years afterwards. 

In vertebrates the modern study of whole-organism physiology uses 
two main tools. The experimental one, which favoured the renaissance 
of this physiology, is mouse genetics. This has allowed us to decipher, 
one gene at a time, how organs influence each other. For the focus of this 
Review, namely skeleton physiology, mouse genetics has proved to be an 
extremely reliable tool, possibly because bone is one of the last tissues 
to appear during evolution, and consequently most genes involved in 
skeletal biology have conserved functions from mice to humans. The 
second tool is the observation of internal medicine: mirror images of 
many physiological processes may be found in disease symptoms or drug 
side effects. Both tools are equally useful in extending the range of func- 
tions of the skeleton and our understanding of their molecular bases. 

In this Review, we show how a whole-organism approach to physi- 
ology has modified our view of the skeleton. To that end, we describe 
work pointing towards a coordinated regulation of bone mass, energy 
metabolism and fertility; a central control of bone mass; and the roles of 
leptin, serotonin, insulin and osteocalcin in these pathways. 


The skeleton and whole-organism physiology 

A strategy that could be used to determine whether the skeleton 
is influenced by organs that are not classically associated with it 
and whether it influences other organs is to confront features of 
bone that are unique to clinical and experimental observations. This 


approach may indicate physiological functions that could affect or 
be influenced by bone. The reality of such interactions could then 
be tested genetically. 

A striking feature of bone is that it is the only tissue that contains a 
cell type, the osteoclast, whose unique function is to resorb (destroy) 
the host tissue’. This does not occur at random but rather in the con- 
text of a true homeostatic function, which is called bone modelling 
during childhood and remodelling during adulthood. This func- 
tion, hereafter referred to as bone (re)modelling, is characterized by 
alternating phases of destruction by osteoclasts and bone formation 
by osteoblasts*. Bone modelling allows longitudinal growth, without 
which most vertebrates could not ambulate and, therefore, could 
not live. It is, by definition, a survival function. Bone (re)modelling 
occurs daily in multiple locations in an organ covering a very large 
surface area. Both the cellular events it entails and the surface area 
of the organ in which they occur suggest there is a high energetic 
cost. Clinical observations add credibility to this view of bone (re) 
modelling as an energy-demanding process. Specifically, the absence 
of food — that is, energy — intake, as in patients with anorexia ner- 
vosa, causes a near-total arrest of growth in children and low bone 
mass in adults”'®. Moreover, in a manner unrelated to food intake, 
gonadal failure leads to low bone mass in both sexes, thereby sug- 
gesting a link between bone mass accrual and fertility'’’. Although 
compelling, such clinical evidence remained correlative. It became 
an incentive for laboratory investigation because of an experimental 
observation that was striking because it was unexpected. Osteocalcin 
was, at the time the encoding gene was inactivated in the mouse, 
the only osteoblast-specific secreted protein”. This by itself justi- 
fied the study of its function in vivo with the hope of learning more 
about bone biology. Surprisingly, osteocalcin deletion resulted in 
mice that were abnormally obese and that bred poorly'*”’. These 
phenotypes suggested that, in ways that still need to be explained, 
bone was affecting fat accumulation and possibly other aspects of 
energy metabolism and reproduction. 

Taken together, this view of bone (re)modelling, clinical 
observations and the phenotypes of mice deficient in the osteocalcin 
gene suggested that there might have been a coordinated regula- 
tion of bone mass or growth, energy metabolism and reproduction. 
Because this hypothesis was triggered by the energy cost of bone 
(re)modelling, one would expect that the hormones orchestrating 
it would appear with the skeleton during evolution. Several labora- 
tories have tested this hypothesis in the past 10 years. 
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A whole-organism physiology view of leptin 

For a bone biologist in the late 1990s, it seemed easier to explore 
the aforementioned assumption by studying a hormone for which 
the biology has been characterized and the receptor identified. This 
hormone was leptin, an adipocyte-specific molecule identified for its 
ability to limit appetite and to favour energy expenditure and repro- 
duction, two tenets of the overarching hypothesis'*'*. Remarkably, 
and this is important in view of its function described here, leptin 
does not appear during evolution with any aspect of energy metabo- 
lism or reproduction, but is instead associated with bone (re)mod- 
elling. This would be mere coincidence if ob/ob (obese) or db/db 
(diabetic) mice, which lack leptin or its receptor, respectively, did not 
demonstrate a high bone mass because of a massive increase in bone 
formation”. This is a biological tour de force because these mice 
have no functional gonads, a situation that would otherwise increase 
bone resorption and decrease bone mass. Subsequently, leptin regu- 
lation of bone mass was verified in sheep and humans”. The high 
bone mass (despite hypogonadism) observed in the absence of leptin 
signalling and the fact that leptin is a vertebrate invention suggested 
that bone was a major target of this hormone; a model of partial 
gain-of-function of leptin signalling allowed this hypothesis to be 
tested. To mediate its functions, leptin binds to a receptor (Lepr) 
linked to the tyrosine kinase Jak2. Leptin binding activates Jak2, 
resulting in the phosphorylation of several residues on Lepr. One 
of these, Tyr 985, binds Socs3, an event that attenuates signalling 
through Lepr’ (Fig. 1). Accordingly, mutation of this residue in Lepr 
(Y985L) in //] mice, which are homozygous for this substitution, 
results in a partial gain of function of leptin signalling”. Because 
the increase in signalling is only partial, phenotypes displayed by 1/1 
mice reveal one or more functions of leptin that require its lowest 
threshold of signalling’. Hence, I/] mice breed normally and have 
normal appetite when fed on normal chow, but are osteoporotic; 
these observations suggest that the threshold of leptin signalling 
necessary to affect bone mass is lower than that needed to affect 
appetite and reproduction”. 

Several genetic pieces of evidence indicate that leptin acts centrally 
to inhibit the accrual of bone mass. The most convincing argument, 
although not the only one, is that a neuron-specific deletion of Lepr 
recapitulates the bone phenotype of ob/ob mice, whereas an osteo- 
blast-specific one does not”. This is consistent with what has been 
shown for other functions of leptin that also occur through a cen- 
tral relay. In broader terms this revealed for the first time a central 
control of bone mass; its existence has now been verified by other 
laboratories studying how neuropeptide Y or neuromedin U regulate 
bone mass”*”’. Over the years it has been proposed that, in contrast 
to most of its functions, leptin could also regulate bone mass through 
local means. This hypothesis is based on studies that injected large 
amounts (micrograms per day) of leptin into wild-type mice” or 
infused leptin into the third ventricle of ob/ob mice”. That leptin did 
not decrease the percentage of fat in the latter study, in contrast to 
expectations, raises questions about the efficacy of these infusions. 

Let us concentrate here on the central mode of action of leptin. We 
needed to know where it signals in the brain to fulfil this function and 
the regulation of energy metabolism (we viewed these two functions 
as co-regulated). Initially, it seemed that the hypothalamus was where 
all the action was taking place. The leptin receptor is highly expressed 
in ventromedial (VMH) and arcuate neurons of the hypothalamus, 
and chemical lesioning of these neurons had demonstrated their 
involvement in the regulation of appetite and bone mass; moreover, 
leptin infusions in the third ventricle of ob/ob mice decreased bone 
mass and appetite only if these hypothalamic neurons were intact”. 
This simple view was shattered by landmark studies showing that 
the selective inactivation of Lepr in VMH or arcuate neurons did 
not affect appetite or the accrual of bone mass in mice fed on a nor- 
mal diet, the diet on which ob/ob mice display hyperphagia*’*. One 
interpretation of these seemingly contradictory results is that they are 
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Figure 1 | Leptin co-regulates appetite and bone mass. Leptin signals 
through its receptor (Lepr), expressed in serotonin-producing neurons of 
the raphe nuclei in the brainstem, decreasing the expression of tryptophan 
hydrxylase 2 (Tph2), the gene encoding the initial enzyme for serotonin 
biosynthesis. Signalling by Lepr requires Jak2 and is negatively regulated 

by Socs3. Serotonin-producing neurons of the raphe nuclei project to the 
ventromedial (VMH) and the arcuate (Arc) hypothalamic nuclei. Serotonin 
signalling in VMH neurons decreases the activity of the sympathetic nervous 
system (SNS), an inhibitor of bone mass accrual. In Arc neurons, serotonin 
signalling has anti-anorexigenic effects. 


instead complementary in suggesting that leptin requires the integrity 
of hypothalamic neurons to regulate appetite and bone mass, but it 
need not bind to them. In other words, leptin might signal elsewhere 
in the brain to regulate the synthesis of neurotransmitters that will 
then act in the hypothalamus. Admittedly, this is a shift in the view 
of leptin signalling in the brain, but a novel approach was needed 
because cell-specific deletion of Lepr in hypothalamic neurons did 
not reproduce phenotypes seen in ob/ob mice fed on normal chow. 
The fact that patients chronically treated with serotonin reuptake 
inhibitors can develop low bone mass suggested that serotonin and 
leptin signalling intersect in the brain*”*. 

In some instances cell-specific gene inactivation can be achieved 
without using sophisticated techniques. These are favourable circum- 
stances because cell-specific gene deletion techniques are not without 
risks. This situation presented itself for leptin and for brain-derived sero- 
tonin, which is made only in neurons of the raphe nuclei and does not 
cross the blood-brain barrier”. It could therefore be eliminated from the 
brain by conventional inactivation of tryptophan hydroxylase 2 (Tph2), 
the rate-limiting enzyme of serotonin synthesis®*. The fact that Tph2*- 
mice were distinctly osteoporotic and anorectic established that brain- 
derived serotonin favours bone mass accrual and appetite and, because 
it does not cross the blood-brain barrier, identified serotonin as the first 
neurotransmitter regulating bone mass accrual’. Axonal connections 
between serotonin-producing neurons and hypothalamic neurons can 
exist and are probably functionally important, because ablations of 
serotonin receptors in specific neurons of the hypothalamus result in 
osteoporosis or anorexia in the same way as Tph2 deletion does”. 
This work, suggesting a direct brainstem—hypothalamus axis, does not 
contradict or exclude a more recent study proposing that serotonin 
signalling in the hypothalamus occurs through an interneuron”. This 
interneuron will need to be identified for the proper evaluation of the 
respective importance of these two modes of action. 
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Leptin inhibits Tph2 expression in, and serotonin release from, 
brainstem neurons; consequently, serotonin content in ob/ob hypo- 
thalami is abnormally high. Removing one allele of Tph2 from ob/ 
ob mice sufficed to normalize their brain serotonin content, with 
remarkable consequences”. Indeed, ob/ ob;Tph2"' ~ mice, despite hav- 
ing no leptin, had normal appetite, energy expenditure, body weight 
and bone mass. Subsequently, the molecular bases of this leptin—sero- 
tonin-hypothalamus axis were deciphered. They include calmodulin 
signalling in VMH neurons activating the transcription factor cycli- 
cAMP-response-element-binding protein (CREB), which induces 
the expression of genes involved in catecholamine synthesis”. This 
is important because the mediator acting as a bridge between leptin 
signalling in the brain and in bone cells is the sympathetic nerv- 
ous system acting on osteoblasts through the B, adrenergic recep- 
tor (Adrb2)*°” (Fig. 1) and because B-blockers can limit the risk of 
osteoporotic fractures”. 

The fact that leptin inhibits both bone mass accrual and appetite 
is consistent with the hypothesis that bone acquisition, an energy- 
expensive process, must be linked to energy (food) intake, otherwise 
the risk of organ failure anywhere else in the body, at the time of a 
growth spurt for instance, could be high. This broader view of leptin 
proposes a simple explanation of why this hormone appeared during 
evolution with bone and when food was scarce. 


Osteocalcin and the endocrine nature of bone 

Implicit in the hypothesis that there is coordinated regulation of bone 
mass, energy metabolism and fertility is the notion that bone is not only 
a recipient of hormonal inputs but also an endocrine organ affecting the 
other two functions. It was already known that, through fibroblast growth 
factor 23 (FGF 23), bone acts as an endocrine organ, but the function 
regulated by FGF23, namely mineral metabolism, is intimately linked 
to bone health’. The question asked at this point was different: does 
bone regulate physiological functions that, a priori, have nothing to do 
with bone health? 

Although this hypothesis was initially formulated on inspection 
of mice deficient in the osteocalcin gene, it was the study of another 
gene and of another mouse model that shaped the concept into the 
form of a genetic pathway. Embryonic stem-cell phosphatase (ESP) 
is an obscure protein tyrosine phosphatase without any known sub- 
strate that is expressed in only three cell types: the embryonic stem 
cell, the Sertoli cell of the testes and the osteoblast’*. This restricted 
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Figure 2 | Osteocalcin, a bone-derived multifunctional hormone. 
Undercarboxylated osteocalcin stimulates insulin secretion and B-cell 
proliferation in the pancreas, energy expenditure by muscle, and insulin 
sensitivity in adipose tissue, muscle and liver. In addition, it promotes male 
fertility by stimulating testosterone synthesis in Leydig cells of the testis 
through the activation of its receptor, GPRC6A, in these cells. 
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pattern of expression justified the study of its function in vivo, and 
for that purpose Esp was deleted either in all cells or only in osteo- 
blasts. The observation that both mouse models developed the same 
phenotypes implied that the biological functions they reveal take 
place in osteoblasts”. 

The absence of Esp in osteoblasts prompted perinatal death of 
mice, resulting from severe hypoglycaemia. This happened because 
the functions of this intracellular tyrosine phosphatase are, through 
its expression in osteoblasts, to inhibit the expression and secretion 
of insulin by pancreatic B-cells, to limit insulin sensitivity in liver, 
muscle and white adipose tissue, and to decrease energy expendi- 
ture’. All of the phenotypes of Esp-deficient mice mirrored those 
observed in mice deficient in the osteocalcin gene, and genetically 
Esp inhibits osteocalcin’s functions'’. The fact that wild-type, but 
not osteocalcin-gene-deficient, osteoblasts induce insulin secretion 
from isolated islets established that osteocalcin is a hormone pro- 
moting B-cell proliferation and insulin expression and secretion. It 
also increases insulin sensitivity and energy expenditure’ (Fig. 2). 

Before considering other aspects of osteocalcin biology one needs 
to address the relevance of these findings to human physiology. To 
the best of our knowledge no molecule identified as a hormone in 
the mouse has lost this function in humans. Although no mutation 
in osteocalcin or its receptor has yet been reported, this assumption 
seems to extend to osteocalcin, which has become an increasingly 
accepted biomarker of insulin resistance in human studies**””. 

Because most hormones have several functions, the next question 
was whether this was the case for osteocalcin. This was even more 
relevant because osteocalcin-gene-deficient mice breed poorly. A 
study of this phenotype revealed that osteocalcin promotes testoster- 
one synthesis by Leydig cells of the testis and fertility in male mice™. 
For that purpose osteocalcin upregulates, in testes but not in ovaries, 
the synthesis of enzymes necessary for testosterone biosynthesis, 
without affecting the expression of the gene encoding Cyp19 (the 
enzyme converting testosterone to oestradiol). This male-specific 
function of osteocalcin implied that its putative receptor (OSTR) 
may be expressed only in testes. This sexual dichotomy allowed 
the identification of GPRC6A, a G-protein-coupled receptor, as an 
osteocalcin receptor, that is expressed in mice and humans in Leydig 
cells of the testis, but not in follicular cells of the ovary’* (Fig. 2). A 
pioneering study had previously proposed that GPR6CA could be an 
osteocalcin receptor, although no binding experiments were done”. 


Osteocalcin puts bone in the thick of things 

If the notion that bone affects energy metabolism and reproduction 
was implicit in the overall hypothesis, what emerged next — that bone 
is an endocrine hub on which several hormones converge to recruit 
osteocalcin as their ultimate relay — was not expected. To appreciate 
this notion, which was an outcome of the study of the relationship 
between ESP and osteocalcin, one needs to understand how osteoc- 
alcin is modified post-translationally. 

Osteocalcin is carboxylated on three glutamic acid residues, a 
modification that confers on proteins a high affinity for minerals 
such as the hydroxyapatite crystal present in the mineralized bone 
matrix’’, Nevertheless, a small but measurable proportion of under- 
carboxylated osteocalcin is found in the serum”, indicating that 
either some osteocalcin is secreted in an incompletely carboxylated 
form by osteoblasts or that osteocalcin becomes decarboxylated 
outside the cell. This was an important question to address because 
cell-based and in vivo investigations had shown that it is the under- 
carboxylated form of osteocalcin (the one in which Glu 13 is not 
carboxylated) that is active on B-cells and Leydig cells'*”*. 

The search for substrates of ESP identified the insulin receptor 
as such a molecule”. The Esp-null mouse, therefore, is a gain-of- 
function model of insulin signalling in osteoblasts. Subsequently, two 
groups working independently showed that mice lacking the insulin 
receptor in osteoblasts (InsR,,, ’ mice) were glucose intolerant and 
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insulin insensitive when fed on normal chow; that is, they were a 
phenocopy of the osteocalcin-gene-deficient mice*”’. Because mice 
lacking the insulin receptor in skeletal muscle or white adipose tissue 
do not display glucose intolerance when fed on a normal diet**™*, 
insulin must act in additional tissues to achieve glucose homeostasis. 
The fact that bone is such a tissue legitimizes the notion that this tis- 
sue is necessary for glucose homeostasis. In addition, InsR,, mice 
had significantly less biologically active (undercarboxylated) osteo- 
calcin in their sera, revealing that insulin signalling in osteoblasts is 
a determinant of osteocalcin bioactivity*’™. In a manner that is both 
elegant and economical, insulin uses the interplay between osteo- 
blasts and osteoclasts for that purpose. Specifically, insulin inhibits 
the expression in osteoblasts of the gene encoding osteoprotegerin 
(Opg)”', which hampers osteoclast differentiation. In other words, 
insulin signalling in osteoblasts favours bone resorption, a process 
that occurs at pH 4.5 (ref. 55). Acidic pH is the only mechanism 
known to achieve decarboxylation of proteins”, therefore, bone 
resorption decarboxylates and activates osteocalcin®’. Thus, ina 
feedforward loop, insulin signalling in osteoblasts promotes its own 
secretion by activating osteocalcin (Fig. 3), and mice and humans 
in which bone resorption is genetically impaired show a decrease 
in the undercarboxylated form of osteocalcin, resulting in glucose 
intolerance”. The functional equivalent of ESP in human osteoblasts 
is protein tyrosine phosphatase 1B (PTP-1B), a tyrosine phosphatase 
previously known for its ability to inactivate the insulin receptor in 
other cell types”. 

If insulin enhances osteocalcin activity, which in turn favours 
insulin secretion, how do vertebrates prevent hypoglycaemia? The 
answer is leptin”. Young ob/ob mice, before they become obese, 
have no molecular or metabolic evidence of insulin resistance, yet 
they are markedly hyperinsulinaemic and hypoglycaemic, indicating 
that a function of leptin is to inhibit insulin secretion’. As is true of 
its other functions, leptin by and large does not act locally on islets 
to affect insulin secretion, but through a neuronal relay®’. Because 
the sympathetic nervous system acting through Adrb2 expressed 
in osteoblasts mediates the inhibition of bone mass accrual by lep- 
tin, it was tempting to ask whether the same pathway might inhibit 
insulin secretion by modulating osteocalcin expression or activity. 
Molecular evidence showed that the sympathetic tone mediates the 
upregulation of Esp expression by leptin and that ob/ob mice have 
more active osteocalcin in their circulation”. Three pieces of genetic 
evidence demonstrate that leptin regulation of osteocalcin activity 
occurs through sympathetic signalling in osteoblasts. First, mice 
lacking Adrb2 in osteoblasts are hypoglycaemic and hyperinsuli- 
naemic. Second, mice lacking one copy of the leptin gene and one 
copy of Adrb2 in osteoblasts are also hyperinsulinaemic and hypo- 
glycaemic. Third, ob/ob mice that lack the osteocalcin gene have 
normal expression of the insulin genes and are normo-insulinaemic 
for significantly longer than ob/ob mice”. 

Altogether, the metabolic functions of osteocalcin and their 
regulation by leptin and insulin reveal how intertwined energy metabo- 
lism and bone (re)modelling are. They also underscore the importance 
of the interplay between osteoblasts and osteoclasts for glucose metab- 
olism and indicate that the regulation of osteocalcin activity occurs at 
both the transcriptional and post-translational levels. 


Energy metabolism, bone mass and bone diseases 

Energy metabolism is a multistep process initiated by food absorption 
along the gastrointestinal tract. Thus, in considering whether there is 
coordinated regulation of bone mass and energy metabolism one needs 
to determine whether any part or function of the gastrointestinal tract 
influences the accrual of bone mass. This is important in view of the fact 
that the gastrointestinal tract is an endocrine organ secreting several 
hormones whose functions are not all known. Clinically there is reason 
to ask whether the gastrointestinal tract affects bone mass: osteoporosis 


is often seen in patients with inflammatory bowel diseases”. 
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Figure 3 | A feedforward loop links insulin, bone resorption and 
osteocalcin activity. Insulin signalling in osteoblasts decreases the expression 
of Opg. The decrease in the ratio of osteoprotegerin (OPG) to receptor 
activator of nuclear factor-«B ligand (RANKL) increases bone resorption by 
osteoclasts. The acidic pH (4.5) in resorption lacunae decarboxylates (that is, 
activates) osteocalcin (GLA-OCN) stored in the bone extracellular matrix. 
Undercarboxylated active osteocalcin (GLU13-OCN) then stimulates insulin 
secretion by the B-cells of the pancreatic islets and promotes insulin sensitivity 
in peripheral organs. ECM, extracellular matrix; InsR, insulin receptor. 
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The study of mouse models of two human genetic diseases, the 
Coffin-Lowry syndrome (a learning disability syndrome with low 
bone mass) and the skeletal manifestations of neurofibromatosis 
1 (NF1), established the existence of a connection between food 
absorption and the accrual of bone mass. In these disorders, bone 
mass abnormalities are due to a decrease or an increase, respectively, 
in the activity of activating transcription factor 4 (ATF4), a tran- 
scription factor that favours bone formation, in part by promoting 
amino acid import into osteoblasts®*. One could, therefore, put 
this function of ATF4 to work to correct the skeletal manifestations 
of Coffin-Lowry syndrome or NF1 in the mouse by modulating 
protein content in their diets® (Fig. 4). Another study looked at 
the influence of the gastrointestinal tract on mineral metabolism. 
Given that bone is a mineralized tissue and that the mineral crystal 
in the bone is made of calcium and phosphate, it is not surpris- 
ing that significant variations in calcium or phosphate metabolism 
affect bone mass. One reason for that is that a high extracellular 
concentration of calcium inhibits secretion by parathyroid glands of 
parathyroid hormone (PTH), whose physiological role is to increase 
bone resorption®. A study relying on mouse genetics and human 
pathology showed that the high acidity (low pH) present in the 
stomach, the entry point of the gastrointestinal tract, is required for 
proper calcium absorption and, therefore, for normal PTH secretion 
and bone resorption” (Fig. 4). This established a simple connec- 
tion between the gastrointestinal tract and bone mass that could be 
exploited for therapeutic purposes. These examples of an influence 
of the gastrointestinal tract on bone mass raised the possibility that 
hormones made by the gastrointestinal tract regulate bone mass. A 
peculiar combination of scientific frustration and luck showed that 
this was not only the case, but also that this regulation is potentially 
important for the treatment of the most frequent bone degenerative 
disease, osteoporosis. 

The cell-surface molecule Lrp5 (low-density-lipoprotein receptor 
related protein 5), despite having no identifiable ligand and no sig- 
nalling pathway, is remarkably important in bone biology for medical 
reasons. First, LRP5 is mutated in two human diseases. In osteopo- 
rosis pseudoglioma (OPPG), a disease characterized by the appear- 
ance of osteoporosis several years after birth, it is inactivated”; in 
high bone mass syndrome (HBM) there is a missense mutation in 
LRP5 that is thought to be a gain-of-function mutation”. Patients 
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Figure 4 | Interactions between the gastrointestinal tract and bone mass. 
Stomach acidity (low pH) is required for the proper absorption of calcium 
(Ca**) and is, therefore, essential to maintain normal levels of serum calcium. 
Serum calcium, in turn, negatively regulates secretion from the parathyroid 
gland of PTH, a hormone that stimulates osteoclast differentiation and 

bone resorption. Bone resorption by osteoclasts also occurs at low pH and 
contributes to the maintenance of serum calcium. Peripheral serotonin is 
produced by the duodenum and inhibits bone formation by osteoblasts, 
whereas dietary intake of amino acids (proteins) favours collagen synthesis by 
osteoblasts. 


harbouring the latter mutation have no detectable symptoms before 
adulthood, again arguing against a developmental defect. In fact, 
the main symptom is that post-menopausal female patients with 
HBM do not develop osteoporosis”. Another reason to draw much 
attention to Lrp5 is that it affects bone mass only by acting on bone 
formation”. In other words, the Lrp5-dependent signalling pathway 
holds the key to an anabolic treatment for osteoporosis, the ultimate 
objective for the most frequent bone degenerative disease. 

Lrps are known to be able to bind multiple ligands; nevertheless, 
because of its homology to Arrow” (a co-receptor in Drosophila 
for Wingless, a homologue of Wnt) it is as a potential co-receptor 
for Wnt proteins that Lrp5 has been studied for the longest time. 
This was a reasonable assumption to test, and cell culture experi- 
ments showed repeatedly that Lrp5, like other Lrps, can function as 
a Wnt co-receptor in vitro and in controlling eye vascularization’. 
However, when it came to its role in bone, there were discrepancies 
between expectations and results obtained through the study of ani- 
mal models. A few examples follow. Lrp5-null osteoblasts proliferate 
poorly in vivo, but normally in cell culture”. Because Lrp5 is a recep- 
tor, this observation is inconsistent with the notion that its signalling 
is affected in osteoblasts. Furthermore, microarray analyses showed 
that genes regulated by Lrp5 (cell-cycle regulators) and by the 
canonical Wnt signalling (Opg) in bone specimens were different”. 
Accordingly, in mice, inactivation of the canonical Wnt signalling 
pathway in osteoblasts or osteocytes does not affect bone formation 
but bone resorption instead’”*”. In humans, patients with HBM who 
have been followed up for more than 20 years have not yet developed 
bone tumours (M. Kassem, personal communication), even though 
many Wnt family members are aggressive oncoproteins”. 

In an unforeseen turn of events, attempts to understand these 
discrepancies have brought serotonin back into the picture. While 
studying the role of brain-derived serotonin through the inacti- 
vation of Tph2, the function of Tph1, the enzyme responsible for 
peripheral serotonin synthesis, was investigated as a negative con- 
trol. Tph1" mice had already been generated when it was real- 
ized that the most overexpressed gene in the absence of Lrp5 was 
in fact Tph1 (ref. 76). From being an internal control in another 
project Tph1 became, overnight, the topic of an independent one. 
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Two groups have generated mouse models of Lrp5 deletion in a cell- 
specific manner and have obtained different results”. The reason 
for this discrepancy is unknown. Because we ourselves generated 
one of these models, we believe it would be inappropriate to use the 
tribune provided by this review to make our case. It is also unneces- 
sary, because, as important as mouse genetics is, it remains a sur- 
rogate for human genetics, and so far all patients with OPPG or 
HBM for whom measurements of circulating serotonin have been 
published have shown high and low circulating levels of serotonin, 
respectively’, Similarly, serotonin-producing tumours cause 
osteoporosis®’, and patients with osteoporosis have high circulating 
serotonin levels** (Fig. 4). As a result, the regulation of bone mass by 
serotonin has now acquired a (medical) life of its own and deserves 
to be studied because it holds great promise for the anabolic treat- 
ment of osteoporosis. This is even more true given that two proof-of- 
principle studies have shown that an inhibitor of serotonin synthesis 
in the gut, shown to be safe in humans in a phase I clinical trial®”**, 
cured bone disease in Lrp5-null mice and gonadectomy-induced 
osteoporosis in rodents. In both cases this was achieved through a 
purely anabolic mode of action, just as Lrp5 affects bone mass*”””. 


Perspective 

The most appropriate way to look forward is to formulate the 
questions that will identify the next frontiers in this aspect of physi- 
ology. For instance, are all of the functions of osteocalcin known? 
What are the genes downstream of osteocalcin in all of its target 
cells? Is osteocalcin the only bone-derived hormone affecting energy 
metabolism? Is there a counterpart of osteocalcin in females that 
affects their reproduction? We will also venture a general and hum- 
bling statement: if we have learned so much in so brief a time about 
the physiology of one organ, it is likely that there are many aspects 
of whole-organism physiology to be discovered through the use of 
model organisms. m 
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The Amazon basin in transition 
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Agricultural expansion and climate variability have become important agents of disturbance in the Amazon basin. 
Recent studies have demonstrated considerable resilience of Amazonian forests to moderate annual drought, but they 
also show that interactions between deforestation, fire and drought potentially lead to losses of carbon storage and 
changes in regional precipitation patterns and river discharge. Although the basin- wide impacts of land use and drought 
may not yet surpass the magnitude of natural variability of hydrologic and biogeochemical cycles, there are some signs of 
a transition to a disturbance-dominated regime. These signs include changing energy and water cycles in the southern 


and eastern portions of the Amazon basin. 


Amazon basin for many thousands of years, but expansion 

and intensification of agriculture, logging and urban footprints 
during the past few decades have been unprecedented. The human 
population of the Brazilian Amazon region increased from 6 million 
in 1960 to 25 million in 2010, and the forest cover for this region has 
declined to about 80% of its original area’. Efforts to curb deforestation 
have led to a steep decline in forest clearing in the Brazilian Amazon, 
from nearly 28,000km* yr’ in 2004 to less than 7,000km* yr * in 
2011'. However, this progress remains fragile. The river system 
produces about 20% of the world’s freshwater discharge’, and the forest 
biomass holds about 100 billion tonnes of carbon (C; refs 3, 4), which is 
equivalent to more than 10 years’ worth of global fossil-fuel emissions. 
Maintaining the biotic integrity of the biome and the ecosystem services it 
provides to local, regional and global communities will require improved 
understanding of the vulnerability and resilience of Amazonian eco- 
systems in the face of change. 

Here we provide a framework for understanding the linkages between 
natural variability, drivers of change, responses and feedbacks in the 
Amazon basin (Fig. 1). Although the basin-wide carbon balance remains 
uncertain, evidence is emerging for a directional change from a possible 
sink towards a possible source. Where deforestation is widespread at 
local and regional scales, the dry season duration is lengthening and wet 
season discharge is increasing. We show that the forest is resilient to 
considerable natural climatic variation, but global and regional climate 
change forcings interact with land-use change, logging and fire in com- 
plex ways, generally leading to forest ecosystems that are increasingly 
vulnerable to degradation. 


H umans have been part of the vast forest-river system of the 


Natural and anthropogenic climatic variation 

Changes in Amazonian ecosystems must be viewed in the context of the 
natural variation in climate®® and soils’ across the region, as well as 
natural cycles of climatic variation and extreme events. A climatic 
gradient spans the Amazon basin (Fig. 2), from the continuously rainy 


northwest to the wet/dry climate and long dry season of the southern 
and eastern regions, including the Cerrado (woodland/savannah) in the 
southeast. This climatic gradient is largely coincident with a gradient in 
land-use change, with more conversion to agriculture in the drier eastern 
and southern regions, indicating the interconnectedness of biophysical 
and socio-economic processes. 

The El Nifio/Southern Oscillation (ENSO) profoundly affects rainfall 
in the Amazon basin’, especially the eastern portion; there is decreased 
flow of the Amazon River and some of its major tributaries during El 
Nifo years, and increased flow and increased flooding during La Nifa 
years®. The ENSO effect is superimposed over a 28-year cycle of vari- 
ation in precipitation*® such that the biggest floods occur when La Nifia 
coincides with the wet phase in the 28-year cycle; this coincidence last 
occurred in the mid-1970s (Fig. 3). The worst droughts occur when El 
Nifio coincides with the dry phase of the longer-term cycle, such as the 
1992 drought. The North Atlantic Oscillation (NAO) also affects the 
region, contributing to, for example, the 2005 drought, which resulted in 
the lowest river levels recorded until then in southern and western 
tributaries*. Although much has been learned about extreme events 
and decadal-scale cycles, no discernable long-term trend has yet been 
identified in the total discharge of the Amazon River’. 


Forests are resistant to seasonal droughts 

The ability of roots to access deep soil water’? and to redistribute it’? 
helps to maintain evergreen canopies during dry seasons, demonstrating 
the adaptation of Amazon forest species to seasonal drought. The com- 
bination of access to deep soil water and less cloudiness permits con- 
tinued plant photosynthesis throughout most of the dry season’. 
However, transitional forests and Cerrado ecosystems, where mean 
annual precipitation is less than 1,700 mm and the dry season lasts for 
=4 months, show clear evidence of dry season declines in evapotran- 
spiration and therefore potential water stress'*. Many tree species in the 
Amazon and Cerrado produce a flush of new green leaves near the end 
of every dry season, which is often detected in satellite images as an 
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Figure 1 | Interactions between global climate, land use, fire, hydrology, ecology and human dimensions. Forcing factors are indicated with red ovals; 
processes addressed in this Review are indicated by green boxes and arrows; and consequences for human society are indicated by blue boxes with rounded corners. 
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Figure 2 | Climatic gradient across the Amazon basin. Main figure, the 

hydrologic Amazon basin is demarcated by a thick blue line; isopleths of mean 
daily precipitation during the three driest months of the year” (in mm; white 
lines) are overlain onto four land-cover classes”*”’ (key at bottom left). These 
isopleths are presented only for areas within Brazil, because of lack of adequate 
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data elsewhere. The arrow emphasizes the trend from continuously wet 
conditions in the northwest to long and pronounced dry seasons in the 
southeast, which includes Cerrado (savannah/woodland) vegetation. National 
boundaries are demarcated by broken black lines. Inset, map showing area of 
main figure (boxed). 
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Figure 3 | Decadal and seasonal variation in flood area. The long-term 
record of simulated interannual variation in percentage deviation from mean 
flood area (a) illustrates how the ENSO events are superimposed over a 28-year 
cycle of high- and low-precipitation phases'”’. Recurrent seasonal patterns of 
flooded area (10° km’) for five selected years (b) are superimposed over the 
interannual variation. Striped bars, La Nina years; stippled bars, El Nino years; 
solid bars, non-ENSO years. 


increase in vegetation indices that involve ratios of red and near-infrared 
reflectance'*’*. The relation between these satellite-based indices of 
seasonal greenness and ecosystem productivity remains an unresolved 
focus of debate in these studies, but in any case, this response represents 
a short-term phenomenon. 


Multi-year or extreme drought 
Experimental manipulations and observations of permanent forest plots 
address responses to multi-year and extreme drought. Two long-term 
drought experiments have produced remarkably similar results, demon- 
strating that adaptation to seasonal drought can be overwhelmed by 
multi-year drought'®’’”. These studies demonstrated a physiological 
adaptation of the trees, which maintained a relatively constant water 
tension in the xylem (isohydry) in both wet and dry seasons; but this 
adaptation may eventually lead to mortality when roots are unable to 
extract enough soil water during multi-year droughts'*. After diverting 
35-50% of total rainfall for three years using below-canopy panels and 
gutters, plant-available soil moisture stores became depleted, wood pro- 
duction declined by about 30-60%, tree mortality nearly doubled, and 
live above-ground biomass decreased by about 18-25% (refs 16, 17). 
Mortality rates increased to nearly three times that in the control plot 
during years 4-7 of rainfall exclusion”. 

The severe 2005 drought in the southwestern Amazon—when dry 
season temperatures were 3-5 °C warmer than normal and rainfall over 
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the SolimGes River basin was only 33-65% of average values*—may have 
exceeded the adaptive capacity of many forest species. Analysis of 51 
long-term monitoring plots across Amazonia showed that, relative to 
pre-2005 conditions, most forest plots subjected to increased water 
deficit in 2005 lost several tons of living tree biomass carbon per hectare, 
owing to a marginally non-significant decline in growth and a significant 
increase in mortality of trees’”. A similarly severe but more extensive 
drought occurred again in 2010, affecting more than half of the basin 
and resulting in the lowest discharge ever recorded at Manaus”. 
Susceptibility to drought is likely to vary regionally, depending on the 
climate (total precipitation and its seasonal distribution) and soil water 
storage properties (texture and depth) to which the existing vegeta- 
tion types (for example, Cerrado woodlands, tall-statured central 
Amazon forests, and transition forests) are physiologically adapted. 
Furthermore, there is evidence that certain taxa are more vulnerable to 
drought-induced mortality'’*. Despite this regional variability, the 
observations of natural droughts and the drought manipulation experi- 
ments indicate similar trends of mortality in response to dry season 
intensity”. 


Land-use change and regional climate 


Land use is changed to capture agricultural and forestry revenues, and 
results in trade-offs with multiple ecosystem services, such as C storage, 
climate regulation, hydrologic balance and biodiversity (Fig. 1). 


The drivers of deforestation 

Road paving is one of the economic activities that stimulates deforesta- 
tion”. Further clearing occurs along networks of ‘unofficial roads’ that 
result from the interacting interests of colonist farmers and loggers”; 
loggers minimize their costs by buying the right to log private lands. 
Although practices vary widely across the region, most small land 
holders (<200ha) have kept more than 50% of their land in some 
combination of mature and secondary forest”. 

International and national demands for cattle and livestock feed are 
increasingly driving land-use change. Direct conversion of forest to 
cropland in 2003, mostly by large land holders, represented 23% of 
the deforestation in forest and Cerrado regions of the state of Mato 
Grosso”’. Although cattle pasture remains the dominant use of cleared 
land, the growing importance of larger and faster conversion to crop- 
land, mostly for soybean export, has defined a trend of forest loss in 
Amazonia since the early 2000s. 

Although selective logging is not an immediate land-use change, it 
often leads to deforestation. From 1999 to 2003, the area annually logged 
in the Amazon basin was similar in magnitude to the area deforested”. 
Logged areas are accessible by logging roads and are likely to be cleared 
within only a few years after initial disturbance”, and those that are not 
cleared havea high risk of burning”. On the other hand, reduced-impact 
logging has been demonstrated to be economically viable, while causing 
only modest and transient effects on carbon storage and water 
exchange*’. Expansion of protected areas has also played an important 
role in reducing deforestation in the Brazilian Amazon (Fig. 4)°’. 


Deforestation alters the energy balance 

Incoming air from the Atlantic Ocean provides about two-thirds of the 
moisture that forms precipitation over the Amazon basin**. The remainder 
is supplied through recycling of evapotranspiration, primarily driven by 
the deep-rooted Amazon trees. 

A large number of observational and modelling studies have sug- 
gested that deforestation causes two main changes in the energy and 
water balance of the Amazon basin, as follows. First, partitioning of the 
net radiation that is absorbed by the land surface changes, with a 
decrease in the latent heat flux and an increase in the sensible heat flux, 
primarily because deforestation results in less vegetation being available 
to transpire water to the atmosphere. Second, replacing the dark rainforest 
with more reflective pasturelands or crops results in a decrease in solar 
radiation absorbed by the land surface. Reforestation can reverse these 
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Figure 4 | The Amazon basin today and future fire risks. a, Protected areas 
and major planned infrastructure. b, The risk of fire by 2050”° under business- 
as-usual deforestation and climate change scenarios”. The numbers in 


trends. Within a few years of pasture abandonment, regrowing 
Amazonian forests establish rates of evapotranspiration and reflectivity 
that are close to those measured in mature forests, even though they have 
not yet recovered the biomass and species diversity of a mature forest***». 

Atmospheric convection and precipitation are driven by the fluxes of 
energy and water from the land surface. Where clearings for cattle pastures 
extend tens of kilometres outward from a road, the air above the deforested 
areas warms up more quickly and tends to rise and draw moist air from the 
surrounding forest, creating so-called ‘vegetation breezes’. This decreases 
rainfall over the forest while increasing cloudiness, rainfall and thunder- 
storms over the pasture*®. Heterogeneous deforestation at large scales 
(hundreds to thousands of km?) leads to more complex circulation 
changes, with suppressed rainfall over core clearings, particularly at the 
beginning and the end of the wet season, and unchanged or increased 
rainfall over large remnant forest patches*”**. These changes also affect 
water and light availability, and the C uptake of the remaining forests, but 
those effects are not yet well quantified. 

At deforestation scales greater than 10° km’, numerical models con- 
sistently suggest that a significant decrease in basin-wide precipitation will 
occur’? due to: (1) a decrease in the evapotranspiration from deforested 
regions and resultant downwind transport of water vapour; and (2) a 
decrease in net absorbed solar energy and a consequent general weakening 
of the continental-scale low-pressure system that drives precipitation over 
the basin. 


Deforestation, climate and river discharge 

Taken alone, a decrease in regional precipitation would result in 
decreased discharge. However, the integrated response of a river system 
depends on the balance between precipitation and evapotranspiration 
effects (Fig. 1). Deforestation within a particular watershed would cause 
reduced evapotranspiration and increased discharge, but deforestation 
at the continental scale could cause reduced regional precipitation and a 
tendency towards decreased river discharge”. 
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a indicate the following cities: 1, Cuiaba; 2, Porto Velho; 3, Manaus; 4, Belém; 5, 
Rio Branco; 6, Paramaribo; 7, Georgetown; 8, Iquitos; 9, Puerto Maldonado; 10, 
Santa Cruz de la Sierra. 


A large disturbance and a long data record are needed to detect 
unambiguously the effect of deforestation on the discharge from a large 
river, given the large interannual and decadal variation in precipitation. 
For most of the major tributaries of the Amazon River, the area deforested 
is not yet large enough to be able to attribute changes in discharge spe- 
cifically to deforestation. Similarly, a temporal trend in sediment load 
could not be distinguished from highly variable interannual and seasonal 
variation for the Madeira River, which drains the southwestern Amazon 
basin*°. However, for the Tocantins River’ and Araguaia River** systems, 
which drain parts of the Cerrado and rainforest environments in the 
southeastern Amazon, the relative contributions of climate variability 
and deforestation have been teased apart. From 1955 to 1995, the area 
of pasture and cropland in the Tocantins basin increased from about 30% 
to 50% and annual river discharge increased by about 25%, but changes in 
precipitation were not statistically significant. Changes of the same mag- 
nitude have occurred in the Araguaia River since the 1970s, and sediment 
load increased by 28% with deforestation. In both rivers, discharge 
increased mostly during the wet season, when flooding risks are greatest. 
If deforestation approaches this magnitude in other tributaries, it is likely 
that land-use change will enhance flooding and sediment transport. 


Regional climate change 

The IPCC fourth assessment climate change model runs show the highest 
probability of significant precipitation decrease predicted for southeastern 
Amazonia, where deforestation is greatest and where the climate and 
ecosystems transition from short-dry-season rainforest to long-dry- 
season savannah ecosystems**“*, Various global and regional climate 
modelling approaches have suggested that once deforestation exceeds 
about 40% of the entire Amazon basin, a ‘tipping point’ might be passed”, 
whereby decreased energy and moisture released to the atmosphere from 
the largely deforested landscape would result in reduced convection and 
precipitation, and a shift in the forest-savannah boundary or large-scale 
dieback of rainforest. 
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A number of deficiencies in the structure and application of global 
climate models suggest that the uncertainty of these simulated outcomes 
and estimated tipping points is very high. A few examples include: (1) 
many of the models simulate too little precipitation in the Amazon owing 
to incomplete representation of the role of the Andes in continental 
circulation and large-scale convection over the core of the western 
Amazon, and also owing to coarse representation of the land surface with 
respect to small-scale meteorological processes”; (2) inter-annual vari- 
ation of sea surface temperatures in the tropical Pacific and tropical north 
Atlantic Ocean are closely associated with extreme flood and drought 
events in the Amazon**”°, but these teleconnections to the Amazon are 
not yet adequately represented in global climate models; and (3) the 
biophysical response of vegetation to increasing atmospheric COs, 
including effects on evapotranspiration, may be one of the largest 
unknowns for the future of the Amazon forests. The probability of simu- 
lated forest dieback due to decreased rainfall is greatly reduced when a 
strong CO, fertilization response is included in a vegetation model™, but 
the scale of the actual impact of increasing CO, on photosynthetic effi- 
ciency remains a large source of uncertainty. 

In summary, the changes in precipitation and discharge associated 
with deforestation already observed in the southern and eastern 
Amazon demonstrate a potential for significant vegetation shifts and 
further feedbacks to climate and discharge. Numerical models strongly 
suggest that potential future deforestation may also cause feedbacks to 
large-scale climate and vegetation distribution, but the models have 
deficiencies that prevent confident prediction of the magnitude or spatial 
distribution of deforestation that would lead to a significant region-wide 
decrease in precipitation—including whether a threshold, or tipping point, 
exists whereby the basin could slip into a dry, stable state. Focusing on a 
theoretical and difficult-to-define tipping point for the entire basin may 
divert the scientific community from the important large-scale regional 
changes that are already taking place, such as lengthening of the dry 
season’””* and increases in river discharge*!” in ecologically and agricul- 
turally important transition zones of the eastern and southern flanks of the 
basin. 


Fire as cause and consequence of change 


The probability of fire is clearly affected by climate and land use, the 
latter providing the majority of ignition sources today”. Fire also affects 
regional climate through a complex set of biophysical and socio- 
economic feedback processes (Fig. 1). 


Smoke changes cloud physics and rainfall 

During the wet season, the air over most of the Amazon region is as 
pristine as air over the open ocean—only a few hundred aerosol particles 
per cm’ of air‘”—inspiring the term ‘green ocean”. In stark contrast, 
burning for land clearing, pasture management and charcoal produc- 
tion, and escaped forest fires during the dry season, increase aerosols to 
more than 40,000 particles per cm’ of air in some regions“”. This smoke 
and haze affects the microphysical processes within clouds that deter- 
mine how droplets are formed, making droplets too small to precipitate 
as rain, thus reducing local rainfall and increasing cloud lifetime”. The 
water vapour remaining in the atmosphere ascends to higher altitudes, 
where it invigorates thunderstorm formation and lightning strikes, but 
not necessarily rain. During the dry season, satellite-based measure- 
ments of aerosol optical depth were inversely correlated with precipita- 
tion®’. In addition to locally smoke-inhibited rainfall, fires cause further 
plant stress due to ozone pollution®’ and thick haze that reduces light 
availability and photosynthesis*. Generally, plants are most productive 
with some scattered light at intermediate levels of aerosol thickness, but 
conditions during the biomass burning season often exceed this 
optimum”. 


Drought increases fire susceptibility 
The tall, dense tree canopy of central Amazonian forests creates a humid 
microclimate at ground level, which naturally protects the forest from 
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fire’. However, several lines of evidence indicate that this natural res- 
istance may be changing: (1) about 39,000 km? of Amazon forest burned 
during the El Nifo drought of 1998”, including intact, closed-canopy 
forests; (2) both logging and drought-induced tree mortality allow sun- 
light to penetrate clearings in the canopy, which dries out the forest 
floor, rendering it more flammable”; (3) after a forest is burned once, 
it is more likely to burn again, because a burned forest dries out more 
easily; and (4) ignition sources have also increased owing to pasture 
management and charcoal making**. Although Brazil has made great 
strides in recent years to reduce rates of deforestation’, the frequency of 
fire has not decreased**, and prospects for continued forest degradation 
resulting from fires escaping nearby agricultural areas may be a growing 
risk in many regions. 


Fires alter forest characteristics 

After fires sweep through Amazonian forests, tree mortality ranges from 
8% to 64% of mature stems (=10 cm diameter at breast height)*°. More 
frequent and/or more severe fires tend to increase tree and liana 
mortality’. Big trees are generally better adapted to surviving fire, but 
tend to be the first to suffer from drought'””*”’. Although surviving stems 
can benefit from the initial pulse of fire-released nutrients and reduced 
competition, fire-induced mortality reduces overall canopy cover, bio- 
mass, and species richness*””*. The decline in plant species diversity also 
reduces the abundance of fruits and invertebrates, thereby changing the 
food supply of birds and other animals*°. Frequent fire could change the 
structure, composition and functioning of vegetation by selecting fire- 
adapted species and favouring more flammable species (for example, 
grasses), thus leading to a more savannah-like ecosystem”. 


Multiple fires retard forest regrowth 

Fire is used as a tool to help clear land for cattle pasture and to slow the 
invasion of woody shrubs, but pastures are often abandoned after a few 
years, when grass productivity declines and weeds can no longer be 
effectively controlled. Despite tremendous diversity in rates of regrowth 
among secondary forests from different regions of the basin, the rate of 
secondary forest regrowth following pasture abandonment was found to 
be negatively correlated with the number of fires that occurred while in 
the pasture phase®. Nitrogen (N) loss during burning alters the natural 
patterns of phosphorus limitation on highly weathered soils. In a study 
of secondary forests growing on abandoned pastures and croplands, 
several soil and foliage indicators of N limitation were strongest in the 
youngest forest stands and became less pronounced as the forests aged*’. 
After decades of forest regrowth, the N cycle gradually recuperates, 
establishing a N-rich mature forest, but the rate of recuperation, as well 
as the rate of forest regrowth, depends, in part, on the legacy of previous 
land use and fire. 


Disturbance effects on greenhouse gases 


Changes in greenhouse-gas emissions due to disturbance processes 
must be placed in the context of natural emissions. Amazonian forests 
and wetlands are significant natural sources of methane and nitrous 
oxide**°° (Fig. 5). Unfortunately, a net carbon balance for the region 
remains elusive. 


Mature forests may be accumulating carbon 

Repeated sampling of about 100 permanent plots in the RAINFOR 
network scattered across nearly all Amazonian countries indicates that 
mature Amazonian forests have been accumulating carbon at an esti- 
mated rate of 0.4PgCyr ' (1Pg=10'°g; 95% confidence interval 
range of estimate, 0.29-0.57 Pg Cyr _') in the decades before the 2005 
drought"’. The fastest growing trees are in the foothills of the Andes, 
where the soils are generally younger and more fertile’, but where the 
trees are generally smaller and shorter-lived*’. In contrast, the biggest 
and slowest growing trees occur in the oldest and more nutrient-poor 
soils of the lowland central and eastern parts of the basin”. The soils of 
mature forests on highly weathered Oxisols and Ultisols are unlikely to 
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Figure 5 | Estimates of Amazonian greenhouse-gas emissions. Estimates of 
annual, basin-wide greenhouse-gas fluxes described in the text are presented 
together here, in a common currency of Pg CO3-equivalents, using 100-year 
global warming potentials for CO, (black), CH, (red) and N2O (purple). Owing 
to large uncertainties, all values are rounded to one significant figure, and even 
these estimates remain subject to debate. Where no estimate is available, “??” is 
indicated. Note that dissolved inorganic carbon (DIC) and dissolved organic 
carbon (DOC) may be transported via groundwater and overland flow from 
upland forests to streamside (riparian) forests, and that CO can be lost 
(evasion) from river water to the atmosphere. 


be significant C sinks”, although more study is needed on a wider 
diversity of soils. 

The cause of observed biomass increases in mature Amazonian forests 
remains unknown. Plausible explanations include a rebound from pre- 
vious human or natural disturbances” or a change in resources that limit 
plant productivity, such as atmospheric COs, soil nutrients, or light 
availability due to changes in radiation, climate and cloudiness”. The 
RAINFOR network is our best indicator of Amazonian above-ground 
biomass change, but the network is neither a systematic nora randomized 
sampling of Amazonian vegetation. Because the network has relatively 
few small plots covering a vast region, the effects of large-scale natural 
disturbances over decadal and longer timescales may not be included 
in the sampling network, leading to an overestimate of a biomass 
increase®’'. Although this challenge to the RAINFOR conclusions has 
been rebutted”””’, resolving the issue will require empirical data on the 
distribution of natural disturbances, which is still poorly known’'. A 
recent analysis of satellite images and meteorological data showed that 
large disturbances (>5 ha) caused by windstorms are rare, with a return 
interval of about 40,000 years (ref. 74), suggesting that such disturbance 
effects may not be common enough to undermine extrapolations of 
carbon uptake rates from the RAINFOR network. However, more work 
on this topic is needed, including better estimates of the return intervals of 
smaller disturbances (<5 ha)’!. 

Full C accounting should also include exports from forests to aquatic 
systems. The river water is supersaturated with dissolved CO, which is 
eventually released to the atmosphere at an estimated rate of about 
0.5 PgCyr * (ref. 75). Estimates of the sources of this C remain poorly 
constrained—about two-thirds may come from leaf and wood detritus 
dropped into the river from flooded forests, with about one-third pro- 
duced by aquatic plants (mats of grasses and other macrophytes) within 
the river, and a small fraction by algae’’. Additional possible sources 
include particulates washed in with soil particles and dissolved organic 
and inorganic C in ground water’””*. We know very little about the C 
budget of flooded forests and riparian zone forests, which probably 
contribute significant terrestrially fixed C to streams and rivers’®. 

Estimates of CO, fluxes based on year-round vertical profiling of 
atmospheric CO) concentrations by aircraft are available now only for 
part of the eastern Amazon. Fire emissions roughly cancel a modest 
biological sink during the dry season, so that a wet season source yields 
an annual net source of C to the atmosphere”’. This result is consistent 
with ground-based estimates of slow growing trees and a concentration 
of land-use change in the eastern part of the basin. It remains to be seen if 
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future aircraft measurements will corroborate the C sink inferred from 
scattered ground measurements in the more intact forests of the western 
part of the basin. 


Disturbing forests causes net C and N loss 

The net effect of Amazonian deforestation and reforestation results in an 
annual net C source of 0.15-0.35 Pg C (ref. 80). Adding C emissions 
from fire and logging extends the range to an annual net release of 
0.2-0.8 PgC (ref. 80). The estimated mean annual C emission from 
deforestation and burning of Cerrado is 0.07 PgC for 2003 to 2008". 
These estimates are improving, in part because of advances in the tech- 
nology for analysing satellite images*’ to combine spatially explicit 
deforestation rates with regionally specific estimates of forest-C stocks*’. 

Pyrogenic CH, emissions from conversion of Amazonian and Cerrado 
native vegetation to pasture are about 1.0 and 0.4Tg CH, yr’, respec- 
tively*', but this does not include shifting cultivation or wildfire. Annual 
pyrogenic N,O emissions from conversion of Amazonian forest to pasture 
are about 0.01 Tg N as N2O (ref. 81), but this does not include shifting 
cultivation or wildfire. 

When forests are replaced by cattle pastures, they can either gain** or 
lose** soil C. Losses are more common where soil C stocks are initially 
large, and gains are more common when management inputs (fertilizer, 
herd rotation, overgrazing avoidance) are greatest**. However, changes 
in soil C stocks are usually dwarfed by much larger losses in tree bio- 
mass. In contrast, the sparse and short-statured trees of the Cerrado 
have less above-ground biomass than an Amazonian forest, but the C 
stocks in roots and soil organic matter of the Cerrado (100 Mg Cha ' in 
the top 1m of soil) can be 2-7 times higher than the above-ground 
stocks*’. Well-managed cultivated pastures may provide enough C 
inputs to maintain soil C®**, but most pastures in the Cerrado region 
are in advanced stages of degradation, where C inputs are too low to 
sustain high soil C storage. 

Amazonian upland forest soils annually take up about 1-3 Tg of CHy, 
and pasture soils are probably a small net annual source of <0.1 Tg CH4 
(ref. 65). A significant net emission of CH, in upland forests has been 
measured, which might include termites or anaerobic respiration in water- 
logged wood, soil, bromeliads, or moss patches, but the source remains 
unknown™. Enteric fermentation by cattle is estimated to emit 2.6 and 
4.1TgCHyyr ' in Amazonian and Cerrado regions, respectively*’. 
Continuing studies point to major hydroelectric reservoirs as an increasing 
source of methane”. On the basis of chamber flux measurements, upland 
Amazonian forest soils are estimated to emit 1.3Tgyr ' of N,O-N 
(ref. 65), which is about 15% of global non-anthropogenic emissions. 
Young cattle pastures have higher N.O emissions compared to forests, 
but old pastures have lower emissions, so the net effect of deforestation has 
been a small annual decrease of <0.1 Tg N2O-N (ref. 65). 

Although secondary forests may be significant carbon sinks in other 
parts of the world®, they currently contribute little to the net C balance 
of the Amazon basin, because they are frequently re-cut before they 
grow large enough to store much C”. Indeed, the area of secondary 
forests is declining where agriculture continues to expand and intensify, 
leading to continued loss of biomass-C from those regions”’. 
Agroforestry and other alternatives to slash-and-burn agriculture for 
smallholders have not been widely adopted, but the potential for sig- 
nificant C sequestration per hectare and the techniques of nutrient 
management in these systems have been demonstrated”””’. 


Emerging evidence for a transition 


Are impacts of land-use and climate change in the Amazon basin sur- 
passing the natural variability of climate, greenhouse-gas emissions, and 
cycles of carbon, nitrogen and water? Thanks to increased research in 
this area, including the Large-scale Biosphere-Atmosphere (LBA) 
experiment in Amazonia (see the accompanying World View in this 
issue for a description of the LBA project), we can answer this question 
for some, but not all consequences of land-use and climate change. For 
greenhouse gases, the answer is probably ‘not yet’ with respect to CH, 
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and N,O, because they remain dominated by large emissions from 
undisturbed wetlands and soils, respectively, but the answer for CO, 
is more complex (Fig. 5). Although a C budget for the basin remains 
uncertain, deforestation has moved the net basin-wide budget away 
from a possible late-twentieth-century net C sink and towards a net 
source. This directional change is consistent with recent results of 
inverse modelling based on the TransCom3 network of CO, measure- 
ments, which reports a shift from a sink in the 1980s to a source in the 
2000s for the tropical Americas”*. Much of the Amazon forest is resilient 
to seasonal and moderate drought, but this resilience can and has been 
exceeded with experimental and natural severe droughts, indicating a 
risk of C loss if drought increases with climate change. The forest is also 
resilient to initial disturbances, but repeated or prolonged disturbance 
changes forest structure and nutrient dynamics, potentially leading to a 
long-term change in vegetation composition and C loss. A combination 
of regional net flux estimates based on aircraft campaign measurements 
with ground-based studies that elucidate process-level understanding is 
needed to narrow uncertainties. 

With respect to energy and water cycles, at least two of the large river 
basins on the southeastern flanks of the Amazon forest that also drain 
the more heavily deforested Cerrado region—the Tocantins and 
Araguaia basins—have experienced increases in wet season discharge 
and sediment load. Evidence for changes in temporal and spatial patterns 
of precipitation, such as extended length of the dry season, is emerging at 
local and regional scales. We cannot yet answer the questions of whether 
total precipitation has changed or whether recent severe droughts and 
other extreme events are clear indicators of patterns expected to persist. 
Narrowing uncertainties about the effects of deforestation on regional 
precipitation, temperature and fire risk will require combining realistic 
spatial patterns of deforestation and degradation with improved mesoscale 
circulation models of climate. 

The emerging evidence of a system in biophysical transition high- 
lights the need for improved understanding of the trade-offs between land 
cover, carbon stocks, water resources, habitat conservation, human health 
and economic development in future scenarios of climate and land-use 
change****>°*, Brazil is poised to become one of the few countries to 
achieve the transition to a major economic power without destroying 
most of its forests. However, continued improvements in scientific and 
technological capacity and human resources will be required in the 
Amazon region to guide and manage both biophysical and socio- 
economic transitions. 
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Retinoblastoma is an aggressive childhood cancer of the developing retina that is initiated by the biallelic loss of RB1. 
Tumours progress very quickly following RBI inactivation but the underlying mechanism is not known. Here we show 
that the retinoblastoma genome is stable, but that multiple cancer pathways can be epigenetically deregulated. To 
identify the mutations that cooperate with RBI loss, we performed whole-genome sequencing of retinoblastomas. 
The overall mutational rate was very low; RB1 was the only known cancer gene mutated. We then evaluated the role 
of RBI in genome stability and considered non-genetic mechanisms of cancer pathway deregulation. For example, the 
proto-oncogene SYK is upregulated in retinoblastoma and is required for tumour cell survival. Targeting SYK with a 
small-molecule inhibitor induced retinoblastoma tumour cell death in vitro and in vivo. Thus, retinoblastomas may 
develop quickly as a result of the epigenetic deregulation of key cancer pathways as a direct or indirect result of RB1 loss. 


Retinoblastoma is a rare childhood cancer of the retina that can 
develop in a sporadic or a heritable form and is fatal if untreated. 
When the RBI gene was first cloned, it was found to undergo biallelic 
inactivation in virtually all retinoblastoma tumours’. Since then, 
hundreds of genetic lesions have been identified in human cancer. 
These genetic lesions can be grouped on the basis of the signalling 
pathways they affect that have direct or indirect mechanistic links to 
many of the common cellular properties, or hallmarks, of cancer. 
Thus, the rate of cancer progression is related to the kinetics of 
acquisition of multiple genetic lesions and/or epigenetic changes that 
ultimately lead to activation of growth-signalling pathways, evasion of 
cell death and senescence, acquisition of limitless replicative potential, 
sustained angiogenesis, and local tissue invasion and metastasis”. 
RB1 inactivation confers limitless replicative potential to retinoblasts 
and is rate limiting for retinoblastoma tumorigenesis’. However, the 
mechanisms that enable retinoblastoma cells to acquire the additional 
hallmarks of cancer remain unknown. Evidence from molecular, 
cellular and cytogenetic studies suggest that RB1 is required for main- 
taining chromosomal stability**, and that its loss leads to chromosome 
instability in cells maintained in culture. These data raise the possibility 
that RB1 inactivation may underlie the rapid acquisition of cooperating 
mutations in key cancer pathways through chromosome instability. 
Alternatively, epigenetic changes may have a more dominant role in 
cooperating with the loss of RBJ retinoblastoma tumorigenesis. RB1 
has been implicated in regulating most major epigenetic processes, 
including microRNA regulation, DNA methylation, histone modifica- 
tion and ATP-dependent chromatin reorganization’ °. Thus, in- 
activation of RB1 in retinoblasts may lead to the rapid epigenetic 


deregulation of cancer genes that contribute to the essential cellular 
properties of retinoblastoma. 

In this study, the St Jude Children’s Research Hospital - Washington 
University Pediatric Cancer Genome Project characterized the genetic 
landscape of retinoblastoma. Whole-genome sequencing (WGS) of 
four retinoblastomas and their paired germline DNA samples showed 
no genetic lesions in known tumour suppressor genes or oncogenes, 
other than RBI and MYCN. More importantly, an orthotopic xenograft 
derived from one of the primary tumours showed no evidence of clonal 
variation or new coding-region mutations. This finding suggests that 
the retinoblastoma genome is more stable than previously believed. 

Unlike the genetic landscape of retinoblastoma, the epigenetic pro- 
file shows profound changes relative to that observed in normal reti- 
noblasts. One of the most striking results was the induction of the 
expression of the proto-oncogene spleen tyrosine kinase (SYK) in 
human retinoblastoma. SYK is required for tumour cell survival, 
and inhibition of SYK with a small-molecule inhibitor caused the 
degradation of MCLI1 and caspase-mediated cell death in retinoblas- 
toma cells in culture and in vivo. These findings highlight how com- 
prehensive genetic and epigenetic analyses of tumours can be 
integrated, leading to the discovery of promising new therapeutic 
approaches and shedding light on the mechanisms underlying the 
rapid progression of retinoblastoma following RB1 inactivation. 


Retinoblastoma whole-genome sequencing 

We performed WGS analysis on four primary human retinoblastoma 
samples (Supplementary Information, section 1, and Supplementary 
Table 1) and on matched normal tissue. Local tumour cell invasion, 
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Figure 1 | Characterization of retinoblastomas samples. a-c, Representative 
retinoblastoma tumour section (SJRB001) stained with haematoxylin and eosin 
(H&E), showing choroidal and optic nerve invasion (arrow). d-f, H&E-stained 
section of the SJRB001X orthotopic xenograft with choroidal (e) and optic 
nerve (f) invasion (arrows). AC, anterior chamber; ON, optic nerve; Sc, sclera. 
Scale bars, 25 um. 


but not metastasis, was evident in each patient (Fig. la-c and Sup- 
plementary Fig. 1). We generated an orthotopic xenograft, SJRBO01X, 
of the primary tumour SJRB001 by inoculating primary tumour cells 
into the vitreous humour of the eyes of immunocompromised mice 
(Supplementary Information, section 2). SJRBOO1X had molecular, 
genetic and histopathologic features similar to those of SJRBOO1 
(Fig. 1d-f; Supplementary Figs 2-4; Supplementary Tables 2-4; and 
Supplementary Information, section 3). 

Using a paired-end sequencing approach, we generated 1,040.9 
gigabase pairs (Gb) of sequence data for the samples described; 
956.8 Gb (92%) was successfully mapped to the NCBI 36.1 reference 
genome (Supplementary Information, section 4, and Supplementary 
Table 5). The average genome coverage was 28.9, and the average exon 
coverage was X23.8 with 98.4% of single nucleotide polymorphisms 
(SNPs) detected across all nine genomes showing concordance with 
their corresponding SNP array genotype calls at the same genomic 
positions (Supplementary Table 5). To provide additional sequence 
coverage, we performed transcriptome sequencing of all four primary 
tumours (Supplementary Information, section 5, and Supplementary 
Table 6). 

We identified 668 validated somatic sequence mutations and 40 
structural variations across the four cases (Table 1). These included 23 
tier-1 mutations in genes, 35 tier-2 mutations in evolutionarily con- 
served regions of the genome (Supplementary Information, section 6), 
309 tier-3 mutations in non-repetitive regions of the genome that are 
not part of tiers 1 and 2, and 301 tier-4 mutations in repetitive sequences 
in the genome (Table 1 and Supplementary Table 7). The average 
number of sequence mutations was 167 per case (range, 56-258), with 


only 3.25 mutations per case (range, 0-5) resulting in amino-acid 
changes (Table 1). The estimated mean mutation rate was 6.7 < 10°° 
per base (range, 1.03 X 10° 7~2.17 X 10” *), which is 15-fold less than 
that in adult tumours analysed by WGS, except for acute myeloid 
leukaemia'’. The predominant changes were C—> A and G-> T trans- 
versions (Supplementary Fig. 5), which is consistent with the possibility 
that some of the transversions result from production of 8-oxoguanine 
during oxidative stress. Tumour SJRB002 had no somatic sequence 
variations that resulted in amino-acid changes; the only structural var- 
iations were the loss of heterozygosity (LOH) at the RB1 locus on 
chromosome 13 and a gain of chromosome 6p. This suggests that very 
few genetic lesions are required for retinoblastoma progression after 
RB1 inactivation. 


RBI inactivation in retinoblastoma 


Both RB1 alleles were inactivated in each sample (Supplementary Figs 6 
and 7). Tumours SJRBO02 and SJRBO03 had mutations in RB1 com- 
bined with copy-number-neutral LOH, and SJRBO01 and SJRBO04 
had somatic sequence mutations combined with RB1-promoter hyper- 
methylation (Supplementary Figs 6-8). Deep-sequence analysis of 
the germline sample from SJRBO02 revealed that about 10% of reads 
contained the R445 nonsense mutation, suggesting the presence of 
germline chimaerism for the RB1 mutation (Supplementary Fig. 6). 
Combining the WGS data with SNP array data of an additional 42 
samples, we found that tumours from patients with lower regional 
nucleotide diversity were much less likely to undergo LOH at the RBI 
locus (Supplementary Tables 8 and 9 and Supplementary Information, 
section 7). These data show a significant association (P = 8 Xx 10 8, 
Fisher’s exact test) between a germline genetic variation and mech- 
anism of biallelic RBI inactivation in retinoblastoma. 


Recurrent lesions in retinoblastoma 


To determine whether any of the 11 genes with somatic mutations 
that caused amino-acid changes or a frameshift in the coding region 
(Table 1; Fig. 2a, b; Supplementary Figs 9 and 10; Supplementary 
Information, section 8; and Supplementary Table 10) were recurrently 
mutated in retinoblastoma, we sequenced all exons from the 11 genes 
in our recurrent screening cohort of 42 retinoblastomas (Supplemen- 
tary Information, section 4). Only BCOR was recurrently mutated in 
retinoblastoma (6 of 46, or 13%). Five of the samples had BCOR muta- 
tions that resulted in truncation of the encoded protein, and one 
sample had a focal gene deletion (Fig. 2c, Supplementary Table 11 
and Supplementary Fig. 11). 

Wealso used the WGS data to identify somatic structural variations 
including whole-chromosome gains and losses, focal deletions (DEL), 
insertions (INS), inversions (INV), intrachromosomal rearrange- 
ments (ITX), interchromosomal rearrangements (CTX) and regions 
of LOH (Fig. 2a-c; Supplementary Information, section 9; Sup- 
plementary Table 12; and Supplementary Fig. 12). The average 


Table 1 | Somatic mutations and structural alterations in retinoblastoma 


Sample* Tier 17 Non-silent tier 1 Genes Tier 2|| Tier 34 Tier 4# Total Mutation rate Structural variations** 
SJRBOO1 D,G 7 4 RB1, CCNC, AGMO, 16 117 85 225 1.03 x 10-7 4 
RHBG 
SJRBOO1 X, D8 0) 0) NA 8 68 9 85 5.87 x 10-8 4 
SJRBOO2 D, G 1 0) NA 1 25 29 56 207 X10: 0 
SJRBOO3 D, G z 4 RB1, HNMT, LHX8, 5 67 50 129 5.79 x 10-8 24 
STOML2 
SJRBOO4 D, G 8 5 RB1, CD300LG, SDK1, 13 100 137 258 8.63 x 10° 12 
TXK, DMWD 
Background mutation rate was calculated on the basis of the ratio of tier-3 mutations to tier-3 bases covered at least tenfold in tumour and germline samples for each pair. 
*D, diagnostic tumour sample; G, germline (blood DNA) sample; X, xenograft sample. 


+ Tier 1 mutations are found in genes and include exons, 5’ and 3’ untranscribed regions, and splice sites. Introns are not included. 


£Non-silent tier-1 mutations change amino acids in genes. 


§ All of the somatic mutations in SJRBOO1 D, G were identified in SJRBOO1X. This row highlights the new mutations acquired in the xenograft relative to the primary tumour. 


|| Tier-2 mutations are found in regions of the genome that are conserved between humans and mice. 
Tier-3 mutations are found in regions of the genome that are not evolutionarily conserved. 
#Tier-4 mutations are found in repetitive regions of the genome. 


** Structural variations include focal amplifications and deletions, LOH, and interchromosomal and intrachromosomal translocations. 
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Figure 2 | Genomic profiles of SJRBO01, SJRB002 and SJRBOO1X. 

a, b, CIRCOS plots of genetic alterations in two retinoblastomas and the 
matched orthotopic xenograft. Loss of heterozygosity (orange), amplifications 
(red) and deletions (blue) are shown. Interchromosomal translocations (green 
lines) and intrachromosomal translocations (purple lines) are indicated. 


number of structural variations was ten per case (range, 0-24) 
(Supplementary Table 1). Tumour SJRB001 had four structural var- 
iations (two DEL and ins INS) including a gain of a region of chro- 
mosome 2 spanning MYCN (Supplementary Table 12), and the only 
chromosomal lesion in SJRB002 was a gain of chromosome 6p, which 
occurs in about 40% of human retinoblastomas” (Fig. 2b). Only a few 
genomic regions were affected by the structural variations in SJRB003 
and SJRB004 (Supplementary Figs 10, 13 and 14 and Supplementary 
Table 12). 


Orthotopic retinoblastoma xenograft 


The genomic landscape of the orthotopic xenograft was remarkably 
similar to that of the primary tumour, despite continuous growth and 
multiple passages in vivo over nine months (Fig. 2 and Supplementary 
Table 1). All of the sequence mutations and structural variations 
detected in SJRBOO1 were retained in SJRBOO1X. Only 67 new SNVs 
and four structural variations were identified in the xenograft, and 
none affected annotated genes (Fig. 2 and Supplementary Table 1). 
Moreover, each mutation was identified at a subclonal level (range, 20- 
30%) and the mutant allele frequency for the lesions in tiers 1-4 in 
SJRBOO1 was retained in SJRBOO1X (Supplementary Fig. 15). 

This result was surprising because several studies in mice and cell 
cultures have linked RB1 inactivation to defects in chromosome segrega- 
tion that result in aneuploidy**'*"* and chromosome instability*, We 
measured the distance between sister chromatids, the distance between 
kinetochores and the proportion of lagging chromatids in two RB1- 
deficient human retinoblastoma orthotopic xenografts'® (SJRBOO1X 
and SJRB002X). Consistent with results from RBI-deficient retinal 
pigmented epithelium cells*,; the distances between sister chromatids 
and between kinetochores were increased, and there was evidence of 
lagging chromosomes (Supplementary Figs 16 and 17). However, less 
variation in ploidy was observed during spectral karyotype analysis of 
SJRBOO1X and SJRB002X, which was more consistent with the ploidy of 
wild-type cells (Fig. 3a, b and Supplementary Table 13). Moreover, copy 
number variations were much lower in our cohort of 46 retinoblastomas 
than in tumours with known genome instability such as ovarian cancer 


Sequence mutations in NCBI RefSeq genes included silent single nucleotide 
variants (SNVs; green), missense SNVs (brown), nonsense SNVs (dark blue), 
splice-site mutations (pink) and insertion/deletion mutations (indels, red). 
*Germline mosaic. c, BCOR mutations identified in the discovery and 
recurrency cohort. 


(Fig. 3c). Together, the cytogenetic data and WGS data suggest that the 
genome is stable and that newly acquired lesions do not provide a 
selective growth advantage and are thus probably passenger mutations 
(Supplementary Information, section 9). 


Identifying deregulated cancer pathways 


There are many examples over the past several decades of epigenomic 
changes such as DNA methylation contributing to tumorigenesis'”"’. 
Indeed, a recent study demonstrated changes in DNA methylation in 
Wilms’ tumours’, which tend, like retinoblastomas, to have stable 
genomes. To explore whether epigenetic deregulation of genes or 
pathways promotes tumorigenesis in retinoblastoma, we carried out 
an integrative analysis of chromatin immunoprecipitation (ChIP) 
data, DNA methylation data and gene expression data using order 
statistics. The SJRBOO1X sample was used for ChIP assay (Sup- 
plementary Figs 18-20), and primary tumour and xenograft samples 
were used for both DNA methylation and gene expression assays. In 
all three analyses, experimental results in retinoblastoma tumours 
were compared with those from human fetal retinae. In this com- 
parison, a total of 104 genes, including 15 known cancer genes (Fig. 4a 
and Supplementary Tables 14 and 15), were found to have significant 
differences, indicating that several key cancer genes were epigenetically 
deregulated. 


SYK is a novel therapeutic target 
SYK is the fifth most significant gene identified by the integrative 
analysis and the only upregulated kinase gene (Supplementary 
Table 15 and Fig. 4a). SYK is expressed throughout the haematopoietic 
system, regulates immunomodulatory signalling and has been impli- 
cated in several haematologic malignancies*!*. Small-molecule inhi- 
bitors of SYK have been developed to treat autoimmune disorders”, 
and two of those agents, BAY 61-3606 and R406, have shown efficacy 
in preclinical leukaemia studies****. 

ChIP-on-chip analysis showed increased activating histone modi- 
fications (H3K4me3 and K3K9/14Ac) at SYK’s promoter, and the 
repressive histone marker (H3k9me3) was unchanged. Binding of 
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Figure 3 | Analysis of aneuploidy and chromosome instability in 
retinoblastoma. a, Chromosomal mis-segregation of SJRBOO1X cells after at 
least 21 rounds of cell division is plotted in red. RPE, retinal pigment 
epithelium. b, Representative spectral karyotype image of SJRB001X after the 
third passage in mice. c, Alterations in the 46 retinoblastoma (Rb) cases relative 


RNA polymerase II to the SYK promoter was also increased (Fig. 4b 
and Supplementary Tables 14 and 15). These ChIP-on-chip results 
were validated in independent samples by real-time PCR analysis with 
reverse transcription (Fig. 4c), and we confirmed the increase in SYK 
gene expression (Fig. 4d). SYK protein levels were higher in human 
retinoblastoma orthotopic xenografts and cell lines than in human 
fetal retinae (Fig. 4e). To determine whether SYK is expressed in 
primary human retinoblastomas, we performed immunohistochemistry 
on a retinoblastoma tissue microarray or whole-eye sections. In total, 
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to 153 high-grade serous ovarian cancer (Ov) cases from The Cancer Genome 
Atlas. The median percentage of the genome involved in copy number 
variations (CNVs) was 1.5% for retinoblastoma and 27.7% for ovarian cancer. 
Red circles, samples used for WGS. 


100% (82 of 82) showed very strong expression (3+) of SYK in all 
tumour cells; SYK was not expressed in normal retina (Fig. 4f). SYK’s 
kinase activity is regulated through autophosphorylation at the Tyr 525 
and Tyr 526 residues within its catalytic domain. These sites were 
phosphorylated in retinoblastoma cells, and this phosphorylation 
was reversed by exposure to BAY 61-3606 or R406 (Fig. 4g and data 
not shown). 

To determine whether SYK expression is required for retinoblastoma 
growth, survival or both, we generated a short hairpin RNA (shRNA) 
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Figure 4 | SYK is expressed in retinoblastoma and is required for survival. 
a, Whole-genome view of the gene ranks based on integrating ChIP-on-chip, 
methylation and gene expression results. The y axis shows —log(P), where P is 
the P value of the Q statistic corrected for multiple testing. Significantly (false 
discovery rate, =10%) downregulated (green) or upregulated (red) genes are 
shown. b, ¢, ChIP validation of histone markers of the SYK promoter including 
quantification by quantitative PCR with TaqMan probes. In c, each bar shows 
mean + s.d. of triplicate samples. d, SYK expression measured by quantitative 
PCR in fetal-week-20 retina (fetal), primary retinoblastoma (tumour), 
orthotopic xenografts (SJRBO01X and SJRB002X) and cell lines. Each bar is 
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mean = s.d. of duplicate samples normalized to GPI1 expression. 

e, Immunoblot of SYK (green) and actin (red) in orthotopic xenografts, human 
fetal retina and representative cell lines; black and white representation of the 
SYK immunoblot is in the lower panel. f, H&E (purple) and anti-SYK (brown) 
immunohistochemistry of retinoblastoma tissue. g, Immunoprecipitation 
analysis of SYK and pSYK Y525/526 from Weril retinoblastoma cells. 

h, Viability was measured in triplicate cultures 72 h after infection of 
retinoblastoma cells with a lentivirus vector expressing either a control 
lentivirus or an shRNA against SYK. Scale bars, 10 um. 
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against SYK and cloned it into the lentiviral vector Lenti-SYK-9. Lenti- 
SYK-9 efficiently knocked down SYK in retinoblastoma cell lines (Sup- 
plementary Fig. 21) and drastically increased apoptosis in retinoblastoma 
cells (Fig. 4h and Supplementary Fig. 21). Similar results were obtained 
in vivo using SJRBOO1X (data not shown). We used an empty lentiviral 
vector and a lentiviral vector encoding an SYK shRNA that less effec- 
tively reduced SYK expression (Lenti-SYK-6) as controls. Cell lines 
that do not express SYK (BJ, 293T and uninduced Jurkat cells) were 
used as controls and the Lenti-SYK-9 lentivirus had no effect on the 
growth or apoptosis of the control cells. 

We exposed retinoblastoma cell lines that express high levels of SYK 
(Weril and RB355) to various concentrations of the SYK inhibitors 
BAY 61-3606 and R406 for 72 h and then measured cell viability. Jurkat 
(uninduced) and 293T cells were used as negative controls. Weril and 
RB355 cells were sensitive to both SYK inhibitors, but the Jurkat and 
293 cells were unaffected (Fig. 5a). Transmission electron microscopy 
of retinoblastoma cells treated with the SYK inhibitors showed mor- 
phologic features consistent with cell death and mitochondrial defects 
(Supplementary Fig. 22); this was confirmed by scoring the proportion 
of activated caspase 3” cells (Fig. 5b, c and Supplementary Fig. 22). 
Jurkat cells showed no increase in activated caspase 3* cells after 
treatment with 10 1M R406 or BAY 61-3606 (data not shown). 

The proportions of cells from each line that incorporated EdU were 
similar, suggesting that retinoblastoma’s cell cycle is not affected by 
SYK inhibition (Fig. 5d, e and Supplementary Fig. 22). The effects of 
targeting SYK were partly rescued by the pan-caspase inhibitor 
Q-VD-OPH (Supplementary Fig. 22). Less MitoTracker Red accu- 
mulated in retinoblastoma cells exposed to the SYK inhibitors (Sup- 
plementary Fig. 22), yet treatment of Jurkat cells with either inhibitor 
had no effect on MitoTracker Red accumulation (data not shown). 
Inhibiting SYK in B-CLL cells reduced their autophosphorylation of 
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SYK and destabilized MCL1°*°”, the only anti-apoptotic member of 
the BCL2 family that is upregulated in retinoblastoma (data not 
shown). Both inhibitors reduced autophosphorylation of SYK on 
Y525/526 (Fig. 4g) and reduced MCLI expression coincident with 
apoptosis (Fig. 5f). 

We tested the efficacy of BAY 61-3606 in vivo using our SJRBOOIX 
model*’. The chemotherapy regimen consisted of a single sub- 
conjunctival dose of BAY 61-3606 on day 1 and daily doses of topotecan 
(TPT) on days 1-5 until either six courses (21 days per course) were 
administered or the tumour had progressed and surgical enucleation 
was required (Fig. 5g, h). BAY 61-3606 plus TPT significantly improved 
the outcome (P = 0.003; Fig. 5i), and its efficacy was correlated with an 
increase in activated caspase 3° cells in the treated eyes (Fig. 5j). 
Previous studies using this model have shown that TPT combined with 
subconjunctival carboplatin had no effect on tumour response or out- 
come*’, so the improvement seen here can be attributed to targeting 
SYK. MCLI expression was also reduced in the treated eyes, which is 
consistent with increased apoptosis and targeting of SYK in vivo 
(Fig. 5k). Together, these results suggest that SYK is a promising 
new target for treating retinoblastoma. 


Discussion 

Genome sequencing reveals that retinoblastomas have a relatively 
stable genome. The mutational rate and number of structural varia- 
tions per case that we assessed were among the lowest reported in 
human cancer to date. Moreover, in one example (SJRB002) the only 
non-silent mutation found was in RB1, and only two structural varia- 
tions were detected. The minimal increase in passenger mutations in 
SJRB001X cells, despite prolonged passage, was also consistent with a 
relatively stable genome. These results are surprising because previous 
studies have shown that the functional inactivation of RB1 can cause 
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Figure 5 | Retinoblastoma cells are sensitive to 
SYK inhibitors. a, Dose response for SYK 
inhibitors R406 (red) and BAY 61-3606 (black) in 
RB355 retinoblastoma cells and a negative control 
(Jurkat). Each data point is mean + s.d. of triplicate 
samples. b-e, Immunofluorescence of activated 
caspase 3 (b, c) or EdU (d, e) (red) before and after 
treatment of RB355 cells with R406 or BAY 61- 
3606. A total of 250 cells were scored in duplicate 
for each sample and each treatment condition to 
derive the mean and s.d. Nuclei were stained with 
4’,6-diamidino-2-phenylindole (DAPI, blue). 

f, Treatment of stimulated Jurkat or RB355 cells 
with 5 uM BAY 61-3606 for 24h reduced MCL1 
expression. g, Schematic of the treatment schedule 
for mice with SJRBO01X tumours. 

h, Representative magnetic resonance images of a 
mouse whose tumour responded after four courses 
of treatment with BAY 61-3606 (left) and another 
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genomic instability. Although retinoblastoma cells show the mitotic 
defects that can lead to errors in chromosome segregation, the ploidy 
of retinoblastoma cells is stable in vitro and in vivo’’**. We do not 
exclude the possibility that RB1 inactivation promotes change in other 
contexts, but our data show that genomic instability is not a hallmark 
of retinoblastoma and do not explain how retinoblastomas progress 
so rapidly. 

We propose that epigenetic mechanisms contribute to retinoblastoma 
tumorigenesis. We identified several known oncogenes and tumour 
suppressor genes with histone modifications and altered DNA methyla- 
tion that correlated with changes in gene expression. Our key discovery 
was that SYK is important in retinoblastoma. Retinal progenitor cells 
and retinal neurons express little or no SYK, and SYK has no known 
function in the developing visual system. Moreover, no recurrent genetic 
lesions in SYK were identified by WGS or SNP array analysis to suggest 
that this gene drives retinoblastoma tumorigenesis. Only by integrating 
epigenetic and gene expression analyses did we identify SYK as an 
important oncogene in retinoblastoma. This is important not only for 
expanding our understanding of the biology of retinoblastoma but also 
for advancing immediate therapeutic options that were not previously 
considered, such as the use of BAY 61-3606 or R406. This study high- 
lights the value of integrating WGS analyses of the genetic and epigenetic 
features of tumour genomes to finding a cure for cancers such as retino- 
blastoma. 


METHODS SUMMARY 


Full details of sample acquisition, molecular and biochemical procedures, informatics 
and whole genome sequencing, and animal and drug studies are provided in Sup- 
plementary Information. The St Jude Children’s Research Hospital Institutional 
Review Board approved experiments involving human subjects and informed con- 
sent was obtained from all subjects. For animal studies, all experiments were per- 
formed in accordance with federal guidelines and regulations. The St Jude Children’s 
Research Hospital Institutional Animal Care and Use Committee approved all 
animal experiments. Lentiviral vectors (GIPZ with Lenti-SYK-9 #V3LHS-366147 
and Lenti-SYK-6 #V3LHS-366143) encoding shRNAs against SYK were pur- 
chased from OpenBiosystems. 
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Structure of HDAC3 bound to 
co-repressor and inositol tetraphosphate 


Peter J. Watson’, Louise Fairall!, Guilherme M. Santos! & John W. R. Schwabe! 


Histone deacetylase enzymes (HDACs) are emerging cancer drug targets. They regulate gene expression by removing 
acetyl groups from lysine residues in histone tails, resulting in chromatin condensation. The enzymatic activity of most 
class I HDACs requires recruitment into multi-subunit co-repressor complexes, which are in turn recruited to 
chromatin by repressive transcription factors. Here we report the structure of a complex between an HDAC and a 
co-repressor, namely, human HDAC3 with the deacetylase activation domain (DAD) from the human SMRT 
co-repressor (also known as NCOR2). The structure reveals two remarkable features. First, the SMRT-DAD 
undergoes a large structural rearrangement on forming the complex. Second, there is an essential inositol 
tetraphosphate molecule—p-myo-inositol-(1,4,5,6)-tetrakisphosphate (Ins(1,4,5,6)P4)—acting as an ‘intermolecular 
glue’ between the two proteins. Assembly of the complex is clearly dependent on the Ins(1,4,5,6)P4, which may act as 
a regulator—potentially explaining why inositol phosphates and their kinases have been found to act as transcriptional 
regulators. This mechanism for the activation of HDAC3 appears to be conserved in class I HDACs from yeast to humans, 


and opens the way to novel therapeutic opportunities. 


The acetylation of lysine residues in the tails of histone proteins plays 
an important role in the regulation of gene expression in eukaryotic 
cells’*. The level of lysine acetylation is controlled through the oppos- 
ing actions of histone acetyl transferases (HATs) and HDACs. 
Although chromatin is the best understood substrate for these 
enzymes, lysine acetylation is emerging as a general regulatory mech- 
anism in a diverse array of cellular processes’. 

There are four classes of HDACs in mammalian cells (reviewed in 
ref. 4). Class I HDACs are zinc-dependent enzymes, and include 
HDACI, 2, 3 and 8. Of these, only HDAC8 is a fully functional 
enzyme in isolation®®. HDAC1, 2 and 3 require recruitment into large 
multi-subunit co-repressor complexes for maximal activity’ ’*. These 
co-repressor complexes bring about the repression of gene expression 
when recruited to repressive transcription factors, but also con- 
tribute to the ‘resetting’ of chromatin after rounds of transcriptional 
activation'*'°. 

In recent years, HDACs have become important targets for the 
treatment of a number of cancers’’. Cancer cell lines treated with 
HDAC inhibitors undergo terminal differentiation, growth arrest 
and/or apoptosis. Several HDAC inhibitors are at various stages in 
clinical trials, and two drugs, vorinostat and romidepsin, have been 
approved for the treatment of cutaneous T-cell lymphomas". 

HDAC] and 2 are found in three co-repressor complexes: NuRD”””, 
CoREST”™”? and Sin3A”*”*. In contrast, HDAC3 appears to be uniquely 
recruited to the SMRT complex (or to the homologous NCoR complex) 
where it interacts with a conserved DAD within SMRT or NCoR*?0?7**7°, 
The DAD both recruits and activates HDAC3'°"’. Recruitment of 
HDAC3 to the DAD is essential for repression by certain nuclear recep- 
tors and for the maintenance of normal circadian physiology”””. It has 
been proposed that the assembly of the HDAC3 and SMRT-DAD 
requires a chaperone complex, because when these proteins are 
expressed in bacteria they do not form a complex”®. 

The DAD contains an extended SANT-like domain with an amino- 
terminal DAD-specific motif. Deletion of this motif results in both 
loss of binding and failure to activate HDAC3"'.We have previously 


reported the structure of the isolated DAD from SMRT”’. This 
revealed that part of the DAD-specific motif forms an extra helix that 
is folded against the three helices of the SANT domain to form a four- 
helix bundle. The N-terminal portion of the DAD-specific motif is 
unstructured in solution”’. 

Here we report the structure of HDAC3 in complex with SMRT- 
DAD. This structure not only reveals the specificity and mechanism 
through which SMRT-DAD recruits and activates HDAC3, but also 
identifies Ins(1,4,5,6)P4 as a key component of the complex that has 
the potential to regulate assembly of HDACs with their co-repressors. 


Overall architecture of the complex 

As HDAC3 and SMRT-DAD do not form a complex when expressed 
in bacterial cells, full-length HDAC3 and Flag-tagged SMRT-DAD 
(amino acids 389-480) were expressed in suspension-grown 
mammalian HEK293 cells. The complex remained tightly associated 
during a three-step purification, including size exclusion chromato- 
graphy (Supplementary Fig. 1). Interestingly, at salt concentrations 
higher than 50 mM, the complex dissociated and the HDAC3 pre- 
dominately took the form of aggregate or oligomers (Supplementary 
Fig. 2). Such oligomerization of HDAC3 has also been reported by 
others”. During crystallization, the HDAC3 tail is proteolysed but 
HDAC3 remains bound to SMRT-DAD and retains deacetylase 
activity (Supplementary Fig. 3). Small crystals (15 um; Supplemen- 
tary Fig. 1) diffracted to 2 A and the structure was solved by molecular 
replacement with an HDACS8 structure**. Model building and refine- 
ment yielded an excellent map to 2.1 A resolution with clear density for 
both HDAC3 and SMRT-DAD (Supplementary Fig. 1). 

Overall, the HDAC3 structure is similar to the previously deter- 
mined class I HDAC structures of HDAC8** and HDAC2*, and 
consists of an eight-stranded parallel B-sheet surrounded by a number 
of a-helices. The active site lies at the base of a tunnel leading from the 
surface of the protein. A solvent-exposed tyrosine residue is located 
on the surface of the enzyme immediately adjacent to the active site 
tunnel. This tyrosine is unique to HDAC3, and it seems that this 
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residue might interact with the substrate and possibly contribute to 
substrate specificity (Supplementary Fig. 4). 


Structural rearrangement of SMRT-DAD 


On forming a complex with HDAC3, the N-terminal helix of the 
DAD undergoes a major structural rearrangement (from that seen 
in the NMR structure), such that it no longer forms part of the core 
structure, but lies along the surface of HDAC3, making extensive 
intermolecular interactions (Fig. 1a, b). Along with a further extended 
region, this DAD-specific motif buries a surface of 1,178 A. The 
remaining three-helix bundle resembles a canonical SANT domain, 
and buries a further 1,160 A at the interface with HDAC3. 

The SMRT-DAD interacts with the N-terminal region of HDAC3 
(residues 9-49) that forms helix H1, loop L2, helix H2 and strand S82. 
This region differs between HDAC8 and HDAC3. Indeed, in HDAC3, 
helix H1 is distorted and can only be considered to be a pseudo-helix. 
These differences may explain why HDACS is active in the absence of 
an interacting co-repressor. 


An essential role for Ins(1,4,5,6)P4 
At the earliest stages of refinement, the electron density difference 


map revealed a well-ordered small molecule bound at the interface 
between HDAC3 and the DAD (Fig. 2a). The electron density was 


In solution 


Figure 1 | Structure of the HDAC3-SMRT-DAD complex. a, Interaction of 
SMRT-DAD (green ribbon) with HDAC3 (grey surface). The Ins(1,4,5,6)P4 at 
the interface is shown as a raspberry coloured surface. Side chains in the DAD 
that mediate interaction with HDAC3 and Ins(1,4,5,6)P,4 are shown as sticks. 

b, Structure of the DAD domain in solution (left; PDB code 1XC5) compared 
with that bound to HDAC3 (right); helices are individually coloured to facilitate 
comparison. 
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sufficiently well defined that the small molecule could be readily 
identified as inositol tetraphosphate. During further refinement, 
it could be unambiguously assigned as D-myo-inositol-1,4,5,6- 
tetrakisphosphate (based on the axial orientation of the hydroxyl 
group on carbon 2), referred to here as Ins(1,4,5,6)P4 (Fig. 2b). 

The Ins(1,4,5,6)P, molecule is sandwiched between HDAC3 and 
the DAD, in a highly basic pocket, making extensive contacts to both 
proteins, burying a surface of 407 A? (Fig. 2c, d). HDAC3 and SMRT- 
DAD each contribute five hydrogen bonds and salt bridges to the 
Ins(1,4,5,6)P4 (His 17, Gly 21, Lys 25, Arg 265, Arg 301 and Lys 449, 
Tyr 470, Tyr 471, Lys 474 and Lys 475, respectively) (Fig. 2e). 

It is notable that the Ins(1,4,5,6)P, is sufficiently tightly bound to 
the complex that it is retained through the entire purification process. 
It is also striking that the binding is highly specific for Ins(1,4,5,6)P4, 
as the electron density shows that the ligand is uniquely Ins(1,4,5,6)P4 
rather than a mixture of inositol phosphates. 

A careful examination of the structure suggests that Ins(1,4,5,6)P4 
binding is an essential requirement for the interaction between SMRT 
and HDAC3, acting as a ‘intermolecular glue’ that cements the com- 
plex together. If the Ins(1,4,5,6)P, ligand were not present, then the 
many basic residues on either side of the binding cleft would probably 
prevent interaction through charge repulsion. Indeed, at the base of the 
cleft, the C-nitrogen atoms of SMRT Lys 449 and of HDAC3 Lys 25 are 
just 4.4 A from each other. Consequently, the assembly of the three- 
way SMRT-HDAC3-Ins(1,4,5,6)P, complex appears to be mutually 
interdependent, such that both SMRT-DAD and Ins(1,4,5,6)P4 are 
required for activation of the HDAC3 enzymatic functionality. 

These conclusions are supported by our previous mutagenesis 
study of SMRT-DAD, in which we looked at the effect of mutations 
on both the interaction with, and the deacetylase activity of, HDAC3. 
All mutations that failed to activate HDAC3 also abolished, or sig- 
nificantly impaired, interaction. These include mutations of Lys 449, 
Tyr 470 and Tyr 471, which play a key role in binding Ins(1,4,5,6)P4 
(ref. 31). The requirement for Ins(1,4,5,6)P4 to enable complex forma- 
tion may contribute to the explanation as to why recombinant 
HDAC3 and SMRT-DAD expressed in bacteria fail to interact, as 
bacteria probably do not contain sufficient Ins(1,4,5,6)P4 to support 
complex formation. 

Having discovered that Ins(1,4,5,6)P4 plays a key role in HDAC3 
activation, we asked whether inositol phosphates might contribute to 
the assembly and activation of other class 1 HDAC complexes. Sig- 
nificantly, the residues that mediate interaction with Ins(1,4,5,6)P4 
and the SANT domain from SMRT are conserved in both HDAC1 
and 2, but not in HDACS8 (Fig. 3a), fitting with the observation that 
only HDAC is fully active in isolation. 

Similarly, the co-repressor partners for HDAC1 and 2 (that is, 
MTAI, 2 and 3 and CoREST1, 2 and 3) contain SANT domains that 
are analogous to SMRT-DAD. The key Ins(1,4,5,6)P4 binding residues 
are almost entirely conserved (Fig. 3b). It is likely that the specificity for 
the particular HDAC is conferred by the region N-terminal to the 
SANT domain: that is, the DAD specific motif in SMRT/NCoR and 
the ELM2 domains in the MTA and CoREST proteins. 

Taken together, this suggests that inositol phosphate activation of 
class I HDACs is a general mechanism and, given that the key residues 
are also found in the yeast HDAC, Rpd3, and the Snt1 co-repressor, it 
would appear that this mechanism is also evolutionarily conserved. 


Mechanism of activation of HDAC3 

The HDAC3-SMRT-DAD structure provides insight into the mech- 
anism through which the DAD and Ins(1,4,5,6)P,4 contribute to the 
activation of HDAC3. In the crystal, the active site of HDAC3 resembles 
a product complex (Fig. 4a, b). An acetate molecule (present during 
purification) is bound at the active site, making hydrogen bonds to the 
catalytic zinc and side chains of Ty 298 and His 134. Furthermore, a 
methionine side chain (from an adjacent SMRT-DAD in the crystal 
lattice) is bound in the active site tunnel, mimicking a lysine residue. 
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Figure 2 | Ins(1,4,5,6)P, binding to the HDAC3-SMRT-DAD complex. 


a, A striking feature in the difference electron density map (F, — F, at 30, green 
mesh) observed following molecular replacement. b, Electron density 
corresponding to the Ins(1,4,5,6)P, ligand following refinement (2F, — F, at 
2.250, blue mesh). The Ins(1,4,5,6)P4 is shown as a stick model. c, Electrostatic 
surface representation of the HDAC3-SMRT-DAD complex (red indicates 
negative potential and blue positive). A strikingly basic pocket is located at the 
HDAC3-SMRT-DAD interface (indicated by a dashed green line). The active 
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Figure 3 | Sequence conservation suggests that class I HDACs, from yeast to 
man, require inositol phosphates for assembly and activation. a, Alignments 
of key regions of class I HDACs from Homo sapiens (Hs) and Saccharomyces 
cerevisiae (Sc). Residues that mediate interaction with Ins(1,4,5,6)P4 and 
SMRT-DAD are highlighted in blue and red, respectively. Other conserved 
residues are highlighted in grey. b, Alignment of SANT domains from known 
interaction partners for class I HDACs. Residues that mediate interaction with 
Ins(1,4,5,6)P4 and HDAC3 are highlighted in blue and red, respectively. Green 
arrows indicate residues that impair HDAC3 recruitment and activation when 
mutated to alanine*’. 
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site pocket of HDACS3 is indicated by a yellow cross. The Ins(1,4,5,6)P4 is not 
shown for clarity. d, Ins(1,4,5,6)P4 (shown as sticks) binding in the basic pocket 
at the HDAC3-SMRT-DAD interface. e, Detailed interactions of Ins(1,4,5,6)P4 
(sticks) with HDAC3 (blue) and SMRT-DAD (grey). Side chains from SMRT 
and HDAC3 which make contacts with the Ins(1,4,5,6)P, are labelled. The 
phosphate groups of the Ins(1,4,5,6)P4 are labelled P1-6. Hydrogen bonds and/ 
or salt bridges are indicated by dashed lines. 


The binding surfaces for DAD and Ins(1,4,5,6)P4 are located to one 
side of the HDAC3 active site, involving pseudo helix H1 and loops L1 
and L6 (Fig. 4a—c). We propose that changes in both conformation 
and dynamics occur when DAD and Ins(1,4,5,6)P4 bind to HDAC3, 
and that these facilitate substrate access to the active site, resulting in 
enhanced enzyme activity. 

There is a key interaction between Ins(1,4,5,6)P, and Arg 265 in 
loop L6 (coloured orange in Fig. 4b). This loop seems to be very 
important for access to the active site, because Leu 266 forms one wall 
of the active site tunnel, and in the absence of the Ins(1,4,5,6)P, this 
loop is likely to be relatively mobile. 

Comparison of the HDAC3 structure with that of HDAC8, which 
does not require activation by complex formation, reveals that 
HDACS8 differs significantly in the region where HDAC3 interacts 
with SMRT-DAD and Ins(1,4,5,6)P4 (Fig. 4d). In HDAC8, the equi- 
valent helix to pseudo helix H1 in HDAC3 has a regular helical struc- 
ture, loop L1 is two amino acids shorter and loop Lé6 contains a proline 
residue that partly orientates the loop away from the active site 
(Fig. 4d). We suggest that together these differences give substrate 
better access to that active site of HDAC8 than would be possible in 
the uncomplexed HDAC3. The pattern of crystallographic temper- 
ature factors for the various structures supports this interpretation 
(Supplementary Fig. 5). 

To test the importance of Arg 265 as well as loops L1 and L6, we 
co-expressed mutant HDAC3 constructs with SMRT-DAD in 
mammalian cells. Strikingly, all the designed mutations resulted not 
only in total loss of deacetylase activity (Fig. 5a) but also abolished 
interaction with SMRT-DAD (Fig. 5b). These findings support our 
interpretation of the importance of these residues in the activation of 
HDAC3. 

To support the conclusions that Ins(1,4,5,6)P, is essential for 
HDAC3 activity and to test the specificity for Ins(1,4,5,6)P4, we 
sought to establish an in vitro reconstitution assay. We expressed 
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Figure 4 | Mechanism of activation of HDAC3 by binding SMRT-DAD and 
Ins(1,4,5,6)P4. a, SMRT-DAD (grey cartoon) and Ins(1,4,5,6)P4 bind adjacent 
to the HDAC3 (charged surface representation) active site. Acetate and a 
methionine (lysine mimic) are located in the active site. b, Details of the 
HDAC3 active site. Key residues and loops are labelled—see text for details 
(hydrogen bonds and/or salt bridges are indicated by dashed lines). ‘YH1, 
pseudo helix H1. ¢, Pseudo helix H1 and loops L1 and L6 are shown in blue on 
the surface of HDAC3 (grey). These regions are influenced or stabilized by 
binding of SMRT-DAD (green) and Ins(1,4,5,6)P4 (shown as sticks). 

d, Comparison of the structures of HDAC3 and HDACS8. Regions of significant 
difference are coloured blue (HDAC3) and red (HDACS8). Helix 1 is indicated. 


and purified HDAC3, Flag-tagged at the carboxy terminus, from 
mammalian cells and incubated the anti-Flag resin with bacterially 
expressed SMRT-DAD and various concentrations of Ins(1,4,5,6)P4, 
Ins(1,4,5)P3 and Ins(1,2,3,4,5,6)P.¢ (Fig. 5c). HDAC activity was very 
sensitive to inositol phosphate concentration and higher concentra- 
tions do not support reconstitution. This can be explained through 
high levels of free inositol phosphate competing with complex forma- 
tion. This is analogous to moderate monovalent salt concentrations 
causing complex dissociation during purification (Supplementary 
Fig. 2). 

A similar pattern of activation was seen using both Ins(1,4,5)P3 and 
Ins(1,2,3,4,5,6)P., but in both cases a tenfold higher concentration 
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Figure 5 | Exploring the role of Ins(1,4,5,6)P, in complex assembly and 
HDAC3 activation. a, Effect of HDAC3 mutations on deacetylase activity of 
complexes with SMRT-DAD. Residues are mutated to their equivalents in 
HDACS. Loop 1 mutation (L1-mut) is H17C/G21A/K25]. Loop 6 mutation 
(L6-mut) is R264P/L265M. ‘Mut-all’ is H17C/G21A/K251/R264P/L265M/ 
R301A. b, Immunoblots showing that HDAC3 mutations perturb interaction 
with SMRT-DAD. Recombinant proteins (Myc-DAD, HDAC3-Flag) and the 
IgG heavy chain are indicated. c, Deacetylase assays of HDAC3-Flag 
reconstituted with bacterially expressed SMRT-DAD in the presence of various 
inositol phosphates. IP4, IP3 and IP6 are Ins(1,4,5,6)P4, Ins(1,4,5)P3 and 
Ins(1,2,3,4,5,6)P., respectively. d, Synthesis pathway for Ins(1,4,5,6)P.4 from 
phosphatidylinositol-(3,4,5)-trisphosphate (PtdIns(3,4,5)P3) and 
phosphatidylinositol-(4,5)-bisphosphate (PtdIns(4,5)P.). Yeast Arg 82 
converts Ins(1,4,5)P3 to Ins(1,4,5,6)P4. In mammals, both IPMK and PTEN are 
required to make Ins(1,4,5,6)P4. 


was required to achieve a similar level of HDAC activity to that 
seen with Ins(1,4,5,6)P,. This supports the interpretation that 
Ins(1,4,5,6)P,4 is the physiologically relevant assembly partner. 


Does Ins(1,4,5,6)P4 regulate HDACs? 

The finding that Ins(1,4,5,6)P4 is essential for the assembly of class 1 
HDAC repression complexes raises the question as to whether it is a 
signalling molecule with a role in regulating complex assembly, or if it 
is simply an essential structural cofactor. 

The fact that Ins(1,4,5,6)P4 remained bound during purification 
would seem to suggest that it is a structural cofactor. However, to 
retain an intact complex it was necessary to use low ionic strength 
buffers. At physiological ionic strengths, the Ins(1,4,5,6)P4 is likely to 
dissociate readily and thus could regulate complex assembly. 

Importantly, several studies have previously implicated 
Ins(1,4,5,6)P4 as a regulator of gene expression and chromatin 
remodelling—although no molecular mechanism was identified. 
Yeast Arg82 is a transcriptional regulator of genes involved in 
arginine metabolism. It has been shown” that Arg 82 is an inositol 
phosphate kinase that converts Ins(1,4,5)P3 to Ins(1,4,5,6)P4 and that 
kinase activity is required for its role in transcriptional regulation 
(Fig. 5d). Several further studies have supported the importance of 
Arg 82 kinase activity in transcriptional regulation and chromatin 
remodelling” *’, together suggesting that Ins(1,4,5,6)P, has an 
important regulatory role in yeast. 

IPMK, the mammalian homologue of Arg 82, has been reported 
to phosphorylate Ins(1,4,5)P; to form Ins(1,3,4,5,6)P, and then 
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Ins(1,3,4,5,6)P5 (refs 40, 41). Consequently, in mammalian cells, a 
phosphatase is required to generate Ins(1,4,5,6)P4. The most likely 
enzyme is the well-known tumour suppressor gene PTEN*’, which 
is known to be active in the nucleus and to play a role in chromosome 
stability (reviewed in ref. 44). It is tempting to speculate that loss of 
HDAC complex function might be one of the routes through which 
PTEN mutations contribute to oncogenesis. 

A final important question is whether the levels of Ins(1,4,5,6)P4 
are regulated in the cell. This is not easy to establish, as regulation 
could be temporally and/or spatially compartmentalized. However 
there is some evidence that both PTEN and InsP, levels change with 
progression through the cell cycle*. It is also possible that changing 
levels of Ins(1,4,5,6)P4, may contribute to the circadian regulation of 
HDAC3 activity*®. Further research will be needed to clarify these 
important questions. 


Discussion 


We present the structure of a histone deacetylase in complex with its 
activating co-repressor. The structure reveals a striking structural 
rearrangement of the co-repressor and an unexpected inositol 
tetraphosphate molecule (Ins(1,4,5,6)P4) acting as an ‘intermolecular 
glue’, contributing to the stabilization and activation of HDAC3. 

Sequence conservation suggests that Ins(1,4,5,6)P, plays a key 
role in co-repressor assembly and activation of class I HDACs from 
yeast to humans and is likely to act as a regulator of HDAC complex 
assembly. Although it remains to be established how the availability of 
Ins(1,4,5,6)P4 might be controlled, the requirement of Ins(1,4,5,6)P4 
for co-repressor HDAC assembly presents novel opportunities for 
therapeutic intervention that may complement existing HDAC inhi- 
bitors. It may be possible to develop molecules that target the 
Ins(1,4,5,6)P, binding site itself, but it may also be possible to target 
the enzymes responsible for Ins(1,4,5,6)P4 synthesis. 


METHODS SUMMARY 


HDAC3 and Flag-tagged SMRT-DAD (residues 389-480) were expressed using 
transient transfection in suspension grown HEK293F cells. The HDAC3-SMRT- 
DAD complex was purified by Flag affinity chromatography, followed by TEV 
protease cleavage and size exclusion chromatography. Crystals of the HDAC3- 
DAD complex were grown by sitting drop vapour diffusion with a 10% propan-2- 
ol precipitant. The structure was determined by molecular replacement based on 
HDACS (PDB code 3EW8). Ligands, including Ins(1,4,5,6)P4, zinc, potassium, 
acetate and glycerol were added during the refinement process. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Protein expression, purification and crystallization. The DAD domain (SMRT 
389-480) and full length HDAC3 were cloned into pcDNA3 vector. The DAD 
domain construct contained an N-terminal 10XHis-3xFlag tag and a TEV 
protease cleavage site. HEK293F cells (Invitrogen) were co-transfected with both 
constructs using polyethylenimine (PEI) (Sigma). To transfect cells, 0.25 mg 
DNA total was diluted in 25 ml of PBS (Sigma) and vortexed briefly; 1 ml of 
0.5 mg ml! PEI was added, and the suspension was vortexed briefly, incubated 
for 20min at room temperature, then added to the cells. (Final density was 
1 X 10° cells per ml in a volume of 250 ml.) For larger volumes, multiple flasks 
were used. Cells were harvested 48 h after transfection and lysed by sonication in 
buffer containing 50 mM Tris pH 7.5, 100 mM potassium acetate, 5% v/v glycerol, 
0.3% v/v Triton X-100, and Roche complete protease inhibitor (buffer A); the 
insoluble material was removed by centrifugation. The lysate was pre-cleared 
using Sepharose 4B (Sigma) and the complex was then bound to Flag resin 
(Sigma), washed three times with buffer A, three times with buffer B (50 mM 
Tris pH 7.5, 300 mM potassium acetate, 5% v/v glycerol) and three times with 
buffer C (50 mM Tris pH 7.5, 50 mM potassium acetate, 5% v/v glycerol, 0.5 mM 
TCEP). The complex was eluted from the resin by overnight cleavage at 4 °C with 
TEV protease in buffer C. The eluted protein was further purified by gel filtration 
on a Superdex $200 column (GE Healthcare) in buffer containing 25 mM Tris/ 
HCl pH 7.5, 50 mM potassium acetate, 0.5 mM TCEP. The purified complex was 
concentrated to 7.5 mg ml! for crystallization trials. 

Crystals were grown by sitting drop vapour diffusion at 4°C using 0.1M 
HEPES pH 7.5, 0.2M NaCl and 10% v/v propan-2-ol. Crystals were cubic in 
nature, grew to a final dimension of 15 X 15 X 15 1m and belong to space group 
C222). 

Comparison of a fresh protein sample with protein from within the crystal- 
lization drops (after 3 months) by SDS-PAGE showed the presence of a truncated 
form of HDAC3 (Supplementary Fig. 2). Analysis by liquid chromatography- 
tandem mass spectrometry showed that this HDAC3 was truncated at the C 
terminus to residue Q376. 

Structure determination. Crystals were flash-frozen in mother liquor containing 
40% glycerol as a cryoprotectant. Diffraction data were collected on a single 
crystal in two 45° wedges at the Diamond synchrotron microfocus beamline 
124 and processed using XDS”’. The structure was solved by molecular replace- 
ment using HDAC8 (PDB code 3EW8)”? as a search model in Phaser®. Initial 
model building was performed with ARP/wARP (http://www.embl-hamburg.de/ 
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ARP/), which was able to automatically build 95% of the HDAC3 protein chain, 
and two helices from the DAD. The additional HDAC3 and DAD sequences were 
fitted following multiple rounds of refinement and building using REFMAC and 
Coot°. F, — F, density consistent with the ligand, zinc/potassium ions and 
acetate/glycerol molecules observed during the refinement/model building pro- 
cess were fitted and refined as they became apparent. The final model contains 
amino acids 2-370 from chain A and 2-370 from chain B of HDAC3, and amino 
acids 408-476 from chain C and 408-475 from chain D of the DAD. The model 
also contains two Ins(1,4,5,6)P4 molecules, two zinc ions, four potassium ions, 
two acetate molecules and four glycerol molecules. The final model has 97.8% 
residues in the favoured region, 2.0% in the allowed region and 0.2% in the outlier 
region of the Ramachandran plot. 

HDAC activity assays. C-terminally Flag-tagged HDAC3 and Myc-tagged DAD 
were co-expressed in HEK 293 cells as described above. Cells were lysed in 50 mM 
Tris pH 7.5, 50 mM potassium acetate, 5% v/v glycerol, 0.3% v/v Triton X-100 and 
Roche complete protease inhibitor. In order to standardise the assay, 800 1g total 
protein was bound to 40 pil Flag resin (Sigma) for 2h at 4°C, then washed four 
times with lysis buffer. HDAC activity was measured using the HDAC Assay Kit 
(Active Motif) and read using a Victor X5 plate reader (Perkin Elmer). 
Reconstitution assays. C-terminally Flag-tagged HDAC3 was expressed in HEK 
293 cells and purified as described above. Hisg-tagged SMRT-DAD was expressed 
in Escherichia coli strain Rosetta (Novagen) and initial purification was carried 
out using Nickel NTA agarose (Qiagen) followed by gel filtration chromato- 
graphy using a Superdex-S200 26/60 column (GE Healthcare) in buffer contain- 
ing 50 mM Tris/HCl pH 7.5 and 50 mM potassium acetate. 40 jl of HDAC3-Flag 
resin was mixed with His-DAD, with or without phosphoinositide as required, in 
a final volume of 1 ml of buffer D (50 mM Tris/HCl pH 7.5, 50 mM potassium 
acetate, 5% glycerol, 0.3% v/v Triton X-100), and incubated for 2 h at 30 °C. The 
resin was then washed extensively with buffer D, and HDAC activity was mea- 
sured using the HDAC Assay Kit (Active Motif) and read on a Victor X5 plate 
reader (Perkin Elmer). 
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The mass function of dwarf satellite galaxies that are observed 
around Local Group galaxies differs substantially from simula- 
tions’ based on cold dark matter: the simulations predict many 
more dwarf galaxies than are seen. The Local Group, however, may 
be anomalous in this regard®’. A massive dark satellite in an early- 
type lens galaxy at a redshift of 0.222 was recently found® using a 
method based on gravitational lensing”, suggesting that the mass 
fraction contained in substructure could be higher than is pre- 
dicted from simulations. The lack of very low-mass detections, 
however, prohibited any constraint on their mass function. Here 
we report the presence of a (1.9+0.1) X 10°Mo dark satellite 
galaxy in the Einstein ring system JVAS B1938+ 666 (ref. 11) at a 
redshift of 0.881, where Mg denotes the solar mass. This satellite 
galaxy has a mass similar to that of the Sagittarius” galaxy, which is 
a satellite of the Milky Way. We determine the logarithmic slope of 
the mass function for substructure beyond the local Universe to be 
1.1*9 8, with an average mass fraction of 3.3*}$ per cent, by com- 
bining data on both of these recently discovered galaxies. Our 
results are consistent with the predictions from cold dark matter 
simulations’*’° at the 95 per cent confidence level, and therefore 
agree with the view that galaxies formed hierarchically in a 
Universe composed of cold dark matter. 

The gravitational lens system JVAS B1938+ 666 (ref. 11) has a bright 
infrared background galaxy at redshift z= 2.059 (ref. 16), which is 
gravitationally lensed into an almost complete Einstein ring of diameter 
~0.9 arcsec by a massive elliptical galaxy at redshift z = 0.881 (ref. 17). 
This bright, highly magnified Einstein ring makes this system an 
excellent candidate in which to search for surface brightness anomalies 
caused by very low-mass (dark matter) substructure in the halo around 
the high-redshift elliptical lens galaxy. The presence of a low-mass 
substructure (for example a luminous or dark satellite galaxy) in the 
lens galaxy can introduce a localized perturbation of the arc surface 
brightness distribution. Owing to the multiplicity of the gravitationally 
lensed images that form these arcs, these surface brightness ‘anomalies’ 
can be analysed using a pixelated-lens modelling technique and used to 
gravitationally detect and quantify the total mass and position of the 
substructure, down to masses as low as ~0.1% of the mass of the lens 
inside the Einstein radius”'°®. The lens system was imaged at 1.6 and 
2.2 um using the Near Infrared Camera 2 on the W. M. Keck 10-m 
telescope in June 2010. The adaptive optics system was used to correct 
the incoming wavefront for the blurring induced by the atmosphere, 
providing a nearly diffraction-limited point spread function with a full- 
width at half-maximum of ~70 mas. Further details of the data sets and 
their image processing can be found in Supplementary Information. 

A smooth parametric model for the lens potential was constrained 
by using the surface brightness emission from the Einstein ring of each 
data set independently, having first removed the smooth light contri- 
bution from the lensing galaxy. We represented the mass model using 
an ellipsoidal power law, p(r) « r ’, where p(r) is the combined 
luminous and dark matter density as a function of the ellipsoidal 


radius, r. The best-fitting model was then fixed and further refined 
using local potential corrections defined on a regular grid, which are 
translated into surface density corrections using the Laplace operator. 
We found for both the 1.6- and the 2.2-um adaptive optics data sets 
that there was a significant positive density correction, which indicated 
the presence of a mass substructure (Fig. 1 and Supplementary 
Information). Directly from the pixelated potential correction, we 
measured a substructure mass of ~ 1.7 x 10°Mg inside a projected 
radius of 600 pc around the density peak. 

As an independent test, we repeated the analysis of the 2.2-11m data 
set, which had the highest-significance positive density correction, 
with different models of the point spread function, different data 
reduction techniques, different rotations of the lensed images, different 
models for the lens galaxy surface brightness subtraction and different 
resolutions for the reconstructed source. We also analysed an inde- 
pendent data set taken at 1.6 um with the Near Infrared Camera and 
Multi-Object Spectrograph on board NASA’s Hubble Space Telescope. 
In total, we tested fourteen different models and three different data 
sets that all independently led to the detection of a positive density 
correction at the same spatial position, although with varying levels of 
significance (Supplementary Information). Differential extinction 
across the gravitational arc could also produce a surface brightness 
anomaly. However, the colour of the arc was found to be consistent 
around and at the location of substructure, ruling out the possibly that 
dust affected our results. 

We used an analytic model to determine the mass and the statistical 
significance of the substructure in the context of a physical model”"®. 
In this analytic approach, a truncated pseudo-Jaffe model was used to 
parameterize the substructure mass and position, giving three extra 
free parameters. To obtain a good fit to the observed surface brightness 
distribution of the lensed background source at 2.2 im, we found that a 
substructure was required at a position consistent with the positive 
density correction detected above. Assuming the substructure to be 
situated in the plane of the lens galaxy, we objectively compared the 
smooth and the substructure parametric models in terms of the 
Bayesian evidence, which is the probability of the data given the model 
(marginalized over all parameters). Computing the marginalized 
evidence involved integrating over the multidimensional parameter 
space within predefined priors. In particular, we assumed that the 
substructure was equally likely to be located at any point in the lens 
plane and to have a mass between 4.0 x 10°Mo and 4.0 x 10°Mo, 
the mass range in which comparison with simulations is possible’**. 
See Supplementary Information for further details on the Bayesian 
evidence. We found that our nominal model, with a substructure of 
mass Map =(1.9 £ 0.1) x 108Mo located at (0.036 + 0.005 arcsec, 
0.576 + 0.007 arcsec) relative to the lensing galaxy (in sky coordinates 
defined positive towards the west and the north, respectively), was 
preferred by a factor of e° over a smooth model. This would 
heuristically correspond to a 120 detection of the substructure, if the 
posterior probability distribution function were Gaussian. This agrees 
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Figure 1 | Detection of a dark-matter-dominated satellite galaxy in the 

gravitational lens system B1938+666 at z = 0.881. The data shown here were 
measured at 2.2 jum using the W. M. Keck telescope in June 2010. Additional 
data sets at 1.6 1m, from the Keck telescope and the Hubble Space Telescope, 
are in Supplementary Information. a, Original data set with the lensing galaxy 
subtracted; b, final reconstruction; c, image residuals; d, source reconstruction; 
e, dimensionless potential correction from a smooth potential required by the 
model to fit the data; f, resulting dimensionless projected density corrections. 
Colour bars show counts for a-d. The total lensing potential is defined as the 
sum ofan analytic potential for the host galaxy plus the local pixelated potential 
corrections defined on a Cartesian grid. The potential corrections are a general 


well with the substructure mass found by the pixelated potential 
correction method described above, given the systematic and statistical 
uncertainties (Supplementary information). 

We also considered a model containing two substructures. The 
corresponding Bayesian evidence was significantly lower than the 
Bayesian evidence of the smooth model and that of the single- 
substructure model. Above, we have quoted the statistical uncertainties 
for the substructure mass and position, but the uncertainties will also 
be related to several sources of systematic error, which are discussed in 
Supplementary Information. In general, the uncertainty in the sub- 
structure mass (under the assumption that it is located in the lens 
plane) is entirely dominated by the inference in the tidal truncation 
radius, which requires the substructure to be de-projected, yielding a 
systematic uncertainty of 0.45 dex. 

On the basis of the detection of the substructure, we calculated the 
joint posterior probability function for the projected mass fraction of the 
halo that is made up of substructure’’, f, and the slope, «, of the sub- 
structure mass function (where dN «x M “dM, where N is the number 
density of subhaloes per comoving volume and M is the substructure 
mass; Supplementary Information). For the JVAS B1938+666 lensing 
galaxy (Mien; = 2.46 x 10°Mo within an Einstein radius of 3.39 kpc), 
we found that f=3.973$% at the 68% confidence level, using a 
uniform prior probability ‘distribution for x, for a substructure mass 
range of 4.0 x 10°Mo to4.0 x 10°Mq. Fora Gaussian prior probability 
distribution for « centred at 1.9 + 0.1, which is the slope of the mass 
function predicted from simulations’*"*, f = 15th 7 9% at the 68% con- 
fidence level. The predicted fraction of sibsteucture from simulations 
within the same mass range and projected distance from the host halo 
centre is ~0.11)}% (ref. 19), which is marginally smaller than the 
lower limit imaplied by our detection of a substructure at the 68% 
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correction to the analytical smooth potential and correct for the presence of 
substructure, for large-scale moments in the density profile of the galaxy and for 
shear. When the Laplace operator is applied to the potential corrections and 
translated into surface density corrections, the terms related to the shear and 
mass sheets become zero and a constant, respectively. A strong positive density 
correction is found on the top part of the lensed arc. Note that these images are 
set on an arbitrary regular grid that has the origin shifted relative to the centre of 
the smooth lens model by Ax = 0.024 arcsec and Ay = 0.089 arcsec. When this 
shift is taken into account, the position of the density correction is consistent 
with the position of the substructure found in the analytic reconstruction 
(Supplementary Information). 


confidence level, independent of the prior set on the slope of the mass 
function. However, these simulations model the formation of a Milky 
Way halo at z= 0, and simulations of elliptical galaxies out to z~ 1 
must be made before a more quantitative conclusion can be drawn. 

Whereas flux ratio anomalies of multiply imaged quasars have previ- 
ously been used to measure statistically the level of substructure in 
cosmologically distant lens galaxies” ~’, these analyses have degenera- 
cies when the substructures are dark and it is difficult to localize and 
measure the masses of individual substructures. Hence, the shape of the 
substructure mass function cannot be constrained and so far it has 
remained unclear whether the results from quasar flux ratio anomalies 
are in agreement or disagreement with numerical simulations'’’. Our 
new low-mass detection also allows us to constrain the slope of the 
substructure mass function for galaxies other than our own, when com- 
bined with the detection of the ~18-fold more massive (3.5 x 10°Mo) 
substructure in the previously reported elliptical lens galaxy SDSS 
J0946+1006 (Miens =2.45 x 10!°M, q Within an Einstein radius of 
4.60 kpc) at redshift z= 0.222 (ref. 8). Combining both detections 
resulted in a slope of «= He 17} for the mass function and an average 
mass fraction of (f) =3.34}: 5604 for elliptical galaxies at the 68% con- 
fidence level. These ais sical that the slope of the mass function 
for elliptical galaxies is similar to that observed for Milky Way satellites, 
but that elliptical galaxies have higher mass fractions. 

So far, detailed studies of the mass and luminosity properties of 
substructures have been limited to the Milky Way'***”* and to some 
extent also to M31 (ref. 27), which lies at a distance of ~ 1 Mpc from the 
Milky Way. About 30 luminous satellite galaxies detected within the 
Milky Way virial radius (~250 kpc) are considered possible cases of 
cold dark matter substructure, albeit at a much lower abundance than is 
predicted from simulations’. Twenty-three of the Milky Way satellites 
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have been found to have masses of ~ 107 Mg within a 300-pc projected 
radius, from observations of the dynamics of their stars”*. This method 
of determining masses is limited to the Local Group owing to the 
faintness of the satellite galaxies, and can have a systematic error if 
the satellite is not in dynamic equilibrium or if there is foreground 
contamination. The three-dimensional mass of the JVAS B1938+ 666 
substructure within a radius of 300 pc is M3o9 =(7.2 + 0.5) x 10’Mo 
(M39 = 3.4 x 10’Mo fora singular isothermal sphere model), which is 
comparable to the masses of the Milky Way satellites”*, but is hosted by 
an elliptical galaxy with a velocity dispersion of oss = 187kms ' ata 
co-moving distance of ~3 Gpc, corresponding approximately to a time 
when the Universe was half the age it is today. A 30 upper limit of 
Ly <5.4x 10’ Ly,o was found for the luminosity of the substructure in 
the rest-frame V band within the tidal radius, r, = 440 + 5 pe (Ly.o; 
V-band solar luminosity). This is about a factor of four brighter than 
both the Sagittarius and the Fornax dwarf galaxies'’. The velocity 
dispersion of the satellite, based on the Einstein radius found from 
the best-fitting model, iso, ~ 16 kms ~ | which corresponds to a circular 
velocity of Veire = 27 kms Its three-dimensional mass within 600 pe 
is Meoo = (1.13 £ 0.06) x 108M. The observed properties (mass and 
circular velocity) are comparable to those of the Sagittarius dwarf 
galaxy, which is a satellite of the Milky Way”. 

The mass-to-light ratios of low-mass satellites of the Milky Way 
have also been found to disagree with the expectations from simula- 
tions. It has been predicted that satellites with the luminosities of 
Fornax and Sagittarius should have a velocity dispersion that is a factor 
of two greater than has been observed (~16 kms__'), which represents 
another problem for the cold dark matter paradigm’. In the case of the 
JVAS B1938+ 666 substructure, the 3c upper limit on the luminosity is 
in the range of the Fornax and Sagittarius satellites, and the resulting 
mass-to-light ratio has a lower limit of 3.5Mo/Ly,o. Although this 
result is consistent with the mass-to-light ratios predicted from simu- 
lations, the limit on the luminosity of the substructure will need to be 
decreased by an order of magnitude before any meaningful com- 
parison can be made. This will only be possible with the next genera- 
tion of large optical telescopes, which is planned to come into 
operation by the end of the decade. 
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silicon-based double quantum dot 
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Silicon is more than the dominant material in the conventional 
microelectronics industry: it also has potential as a host material 
for emerging quantum information technologies. Standard fabrica- 
tion techniques already allow the isolation of single electron spins in 
silicon transistor-like devices. Although this is also possible in other 
materials, silicon-based systems have the advantage of interacting 
more weakly with nuclear spins. Reducing such interactions is 
important for the control of spin quantum bits because nuclear 
fluctuations limit quantum phase coherence, as seen in recent 
experiments in GaAs-based quantum dots’. Advances in reducing 
nuclear decoherence effects by means of complex control*> still 
result in coherence times much shorter than those seen in experi- 
ments on large ensembles of impurity-bound electrons in bulk 
silicon crystals’. Here we report coherent control of electron spins 
in two coupled quantum dots in an undoped Si/SiGe hetero- 
structure and show that this system has a nuclei-induced dephasing 
time of 360 nanoseconds, which is an increase by nearly two orders 
of magnitude over similar measurements in GaAs-based quantum 
dots. The degree of phase coherence observed, combined with 
fast, gated electrical initialization, read-out and control, should 
motivate future development of silicon-based quantum informa- 
tion processors. 

Coherent control of single, isolated quantum bits (qubits) has now 
been demonstrated in a large variety of physical systems®, but not in 
silicon. Silicon qubits have garnered interest for over a decade in part 
owing to the potential to exploit techniques and infrastructure from the 
established microelectronics industry””®. Interest in silicon is further 
supported by the expectation of the material’s superior spin coherence 
times relative to other semiconductors, a result of both the reduced 
hyperfine coupling to nuclear spins in the semiconducting host material 
and the reduced number of nuclear spins. Despite the significant interest 
in silicon-based electrically gated qubits, progress has been hampered by 
several fabrication and design challenges. For example, the larger in- 
plane effective electron mass in silicon (which is nearly three times larger 
than in GaAs) shrinks the electron wavefunctions where the quantum 
dots are formed. Smaller wavefunctions necessitate the fabrication of 
correspondingly smaller devices to facilitate the isolation of a single 
electron in each dot. Another concern is the valley degeneracy present 
in bulk silicon, which is removed by an appropriate combination of 
strain, spatial confinement and interface properties. Considerable 
progress has been made in overcoming these and other obstacles, and 
has led to recent measurements demonstrating sufficiently large valley 
splitting and sufficiently long spin relaxation times for spin-qubit 
initialization and read-out operations''”. 

In the present work, we form quantum dots by depleting charge 
from a two-dimensional electron gas within a Si/SiGe undoped hetero- 
structure with lithographically patterned electrostatic gates (Fig. 1). 
We adjust the carrier concentration within the silicon quantum well 
using an isolated global gate above the device, and we control the 
charge states of the two quantum dots using the other gate electrodes. 
After initially depleting the double dot of all but two electrons (with 


one in each dot), we optimize the coupling of the quantum dots and 
electron-bath tunnel barriers to allow coherent dot manipulations. 
This and similar devices and the design advances that have facilitated 
coherent operation are further described in ref. 11; the work presented 
here used device C of that report. 

As previously described’ for a GaAs double quantum dot, the coherent 
electron manipulations we discuss involve operating the silicon double 
dot in the (1,1) charge configuration, where two electrons are 
separated with one in each dot. In this configuration and at a finite 
magnetic field, which splits off the m= +1 spin triplet states, (1,1)T=, 
the spin singlet state, (1,1)S, and the m = 0 spin triplet state, (1,1)To, 
may be treated as the two states of a qubit (Supplementary Informa- 
tion). To measure these states, we rely on spin-to-charge conversion 
based on Pauli spin blockade. Figure 2a shows a spin blockade sig- 
nature at the (0,2)—(1,1) anticrossing obtained by cyclically pulsing the 
gates to transfer the double dot through the (0,1) — (1,1) — (0,2) 
sequence. During the (0,1)— (1,1) transition, an electron is loaded 
from the bath into the left-hand dot. Both singlet and triplet states, 
including states formed from different valleys, are populated. Whereas 
the lowest-energy singlet state is able to freely transfer into (0,2), other 
states, including the lowest-energy triplet, are blocked by the Pauli 
exclusion principle’**°*. This blockade is indicated by a change in 
the conductance of a nearby quantum point contact (QPC), which 
discriminates between the (1,1) and (0,2) dot charge states. The mea- 
sured (0,2) signal level is used to define 100% singlet probability in 
subsequent experiments. The conductance change between (1,1) and 
(0,2), together with the known fractional time spent in the measure 
phase and an experimentally estimated amount of blockade relaxation 
during measurement, allows us directly to convert measured signal 
to singlet probability for all data presented here (Supplementary 
Information). 


Figure 1 | Device design. a, Device cross-section showing undoped 
heterostructure, dielectric and gate stack. b, Scanning electron micrograph of 
actual device before dielectric isolation and field gate deposition. Electrostatic 
gates are labelled L (left), T, R (right), PL, M, PR and Q; gates L and R are used 
for fast pulsing. The straight arrows show current paths for transport 
experiments (Supplementary Information), whereas the curved arrow shows 
the path of current through the QPC. A numerical simulation of the electron 
density for the two-electron (1,1) state is superimposed on the micrograph. 
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Figure 2 | Spin blockade in the double quantum dot. a, Subregion of the 
charge stability diagram, showing the differential transconductance (black and 
white lines) of the charge-sensing QPC as a function of gate voltages. The 
trapezoidal spin blockade signature is seen with cyclic pulsing through the 
sequence (0,1) — (1,1) > (0,2) (orange arrows). Being a differential 
measurement, the technique is sensitive only to changes in the average double- 
dot charge configuration, which is why only the edges of the trapezoidal 
blockade region are visible. The different colours indicate different responses of 
the QPC to electron movements on and between the dots. In particular, the 
current through the QPC decreases when electrons are added to either dot 
(white lines) but it increases when an electron transfers from the right dot to the 
left dot (black lines) (Supplementary Information). The voltage on gate Q (Va) 
was swept with that on gate R (Vp) to compensate for the capacitive coupling 
between gate R and the QPC and hence maintain constant sensitivity 
throughout the scan. b, Energy diagram of the (0,2)-(1,1) anticrossing, showing 
energies of the qubit states (1,1)S and (1,1)To and the Zeeman split (1,1)T+ 
states as functions of detuning, «. The exchange-energy splitting between qubit 
states, J(¢), the Zeeman splitting between triplet states, Ez, and the tunnel 
coupling, f,, are shown. c, Spin funnel obtained by measuring the degeneracy of 
(1,1)S and (1,1)T_ as a function of ¢ and B. The width of the spin funnel implies 
a tunnel coupling, f., of approximately 3 j1eV. The functional form of J(«) 
extracted from the funnel data is shown in Fig. 5. 


The spin blockade is lifted by the presence of an excited state, observed 
when biasing deeper into the (0,2) charge region. The excited state opens 
an additional unblocked pathway and results in the trapezoidal blockade 
region seen in Fig. 2a. The trapezoid height indicates an excited state at 
an energy of approximately 140 eV. Numerical modelling suggests that 
this state is a (0,2) triplet involving an excited valley state of the right- 
hand dot, but it could alternatively be an orbital triplet if the right-hand 
dot is highly asymmetric'™*. In either case, the blockade region is 
sufficiently wide to distinguish the two qubit states (1,1)S and (1,1)To, 
and all other excited states are high enough in energy to have no further 
role in coherent operation. 

The coherent evolution of the singlet and triplet qubit states is gov- 
erned by two interactions. First, the singlet-triplet energy splitting, 
which is primarily a result of hybridization of (1,1) and (0,2) singlets, 
acts as an effective exchange interaction with energy J between the two 
spins. Second, the dot-to-dot difference in the instantaneous hyperfine 
coupling to *’Si nuclear spins (which comprise 4.7% of the lattice nuclei 
in isotopically natural silicon) acts as a singlet-triplet coupling energy, 
Ayr. This coupling fluctuates from one experimental run to the next 
with a Gaussian distribution of standard deviation oyp. In a Bloch 
sphere picture with the singlet and triplet states at the poles, effective 
exchange causes a rotation about the z axis and the hyperfine energy 
introduces a random x component to this rotation axis. Experimentally, 
we have direct control over only J and are able to vary it by electrically 
pulsing the left- and right-hand gates to adjust the detuning, ¢, between 
the dots (Fig. 2b). Because J and Ay; provide independent rotation axes 


LETTER 


and we only have direct control of J, exploration of the Bloch sphere 
requires us to achieve J values both much larger than and much smaller 
than oyp. For deep pulses into the (1,1) charge configuration, J is 
expected to be proportional to ¢,/|é|, where f, is the interdot tunnel 
coupling”. The tunnel coupling was kept small to access J values below 
the small oy, in silicon. 

Anestimate of the tunnel coupling is provided by measuring J(¢) near 
é = Oina ‘spin funnel’ experiment™. An applied magnetic field, B, splits 
the (1,1)T+ states from (1,1)To by the Zeeman energy, Ez = +gupB, 
where jg is the Bohr magneton and gis the effective electron g factor in 
silicon. The energetic crossing of (1,1)T_ with (1,1)S occurs where 
J(e) = |Ez| (Fig. 2b). At this crossing, hyperfine-induced mixing of 
(1,1)S and (1,1)T— occurs, which is detected by the charge sensor as 
a spin blockade of the (1,1)T_ state. The detuning dependence of the 
energetic crossing point, corresponding to the form of J(¢) as a func- 
tion of B, resembles a funnel whose width can be used to estimate t, 
(Fig. 2c). Using the funnel, we tuned our device until we obtained an 
acceptable t. value of about 3 peV. 

Controlled rotations between superpositions of the singlet and triplet 
states may be achieved by modifying J. Such an experiment, following 
ref. 1, produces data analogous to Rabi oscillations. We start with the 
(0,2)S ground state and then slowly separate the electrons by biasing the 
dots deep into the (1,1) charge configuration (Fig. 3a, b). As a result of 
adiabatic passage (Supplementary Information and ref. 1), a low-J 
ground state is prepared that is a superposition of the (1,1)S and 
(1,1)To states owing to the hyperfine field but, because J is non-zero, 
is not a hyperfine eigenstate. This superposition is rotated on the Bloch 
sphere by the non-adiabatically applied, larger-J exchange pulse. The 
rotation axis depends on both J and App (Fig. 3d). After the finite- 
duration exchange pulse, the gates are then pulsed back deep into 
(1,1) and adiabatically ramped into (0,2) for read-out. During this 
process, the low-J ground state is mapped back to the Pauli-unblocked 
hybridized singlet and the orthogonal low-/ state is mapped to the 
Pauli-blocked triplet. 

Figure 3c shows the resulting oscillation in the singlet probability, P,, 
as a function of the exchange-pulse duration and detuning. At less 
negative detunings, corresponding to high values of J, the initial average 
visibility of the Rabi oscillations is 0.7 + 0.1. At more negative values of 
é, corresponding to lower values of J, the initial visibility is reduced and 
hyperfine-induced damping is observed. At intermediate detunings, as 
many as 20 oscillations can be resolved. The average visibility, the 
damping at low J values and the phase modulation of the oscillations 
closely follow the behaviour expected when averaging over a Gaussian 
distribution of hyperfine differences Ajp, as further described in 
Supplementary Information. Curve-fitting the data to this model allows 
us to extract J() over a wide bias range. 

The practicality of implementing quantum computation using 
silicon-based double quantum dots depends on how long a separated 
(1,1)S state takes to dephase into an incoherent mixture of (1,1)S and 
(1,1)To over multiple experimental runs. This timescale, T>*, directly 
impacts quantum control requirements, because it constrains either 
the speed or the complexity of high-fidelity gate operations. In III-V 
semiconductor quantum dots, T>* is reported to be a few to tens of 
nanoseconds'***°. The measurement of T,* is accomplished by pre- 
paring a state orthogonal to the hyperfine rotation axis and then allow- 
ing it to evolve freely in many trials. In each trial, the hyperfine rotation 
frequency is different, such that on averaging over those trials a 
dephasing-induced decay is observed. In our double-dot system, we 
prepare the initial singlet state by biasing the device into the (0,2)S 
state and quickly (within ~20 ns) pulsing the gates to a large negative 
detuning deep in the (1,1) charge configuration (Fig. 4a, b). The 
pulsing is non-adiabatic with respect to the nuclear mixing time, so 
the separated electrons are initially in the (1,1)S state. The subsequent 
state precession is a function of both the instantaneous hyperfine 
gradient and the remaining exchange energy. As a result, an ensemble 
average both oscillates and dephases into a mixture of (1,1)S and 
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Figure 3 | Rabi oscillation pulse sequence and data. a, b, Essential elements 
of the pulse sequence used to obtain Rabi oscillations: V;, and Vp superimposed 
on the charge stability diagram (a); and represented as functions of time 
(b). The (0,2)S ground state is first prepared at point F with an exchange of an 
electron with the bath of the right-hand dot, if necessary. We then pulse to an 
intermediate point, P, before the subsequent adiabatic transition into the low-J 
ground state at point S, deep into (1,1). An exchange pulse and state rotation is 
then applied at E and followed by a return to S. The adiabatic transfer is then 


(1,1)To (refs 23, 26). The gates are then pulsed quickly back into the 
(0,2) charge configuration for measurement. 

Figure 4c shows the measured ensemble-averaged singlet probability 
as a function of separation time for different detunings, corresponding 
to different values of residual exchange energy, J, present during the 
separation pulse. For moderate negative detunings (J > oyp), the singlet 
state rotates primarily around the z axis and the singlet probability 
remains close to 100% (Fig. 4d). For large negative detunings 
(J< opp), the singlet state rotates around the x axis and dephases, 
bringing the singlet probability to approximately 50%. Intermediate 
detunings result in oscillation frequencies and asymptotic probabilities 
between these two extremes. Fitting the data in Fig. 4c with a consistent 
model”*”* for all traces provides estimates for the residual values of Jand 
Our (Supplementary Information). The extracted functional form of 
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Figure 4| T,* pulse sequence and data. a, b, Essential elements of T>* pulse 
sequence. Analogous with the Rabi oscillation pulse sequence (Fig. 3a, b), we 
initialize in the (0,2)S ground state at F and then pulse to intermediate point P. 
We then non-adiabatically pulse deeply into (1,1) to point S (this is the 
separation pulse) for a variable amount of time for singlet dephasing, and then 
return non-adiabatically to point M for measurement. c, Evolution of (1,1)S as 
indicated by P,, as a function of separation time for several different separation- 
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reversed and the dots are biased to point M for measurement, which generally 
consists of >90% of the pulse cycle. c, Rabi oscillations of the singlet probability, 
P,, as a function of ¢ and exchange-pulse duration. d, Bloch sphere 
representations of state evolution for J > yp (top) and J < oy (bottom). 
Several instances of initial hyperfine ground states are shown as black dots (all 
with Ay¢ > 0, for clarity), with corresponding rotation axes and trajectories. 
The data in c correspond to the average probability of returning to the initial 
prepared low-J ground state. 


J(é) is shown in Fig. 5 and is in excellent agreement with the values 
extracted from the Rabi oscillation periods where the two data sets 
overlap. The model yields an estimate for oyp of 2.6+0.2neV. A 
theoretical calculation of the expected oy;p value in isotopically natural 
silicon, using real-space, hybrid Poisson-Schrédinger/full-configuration- 
interaction simulation methods, and using our best estimates of the 
microscopic device geometry and tuning configuration, yields a value 
for Oy of 1.9neV (Supplementary Information and ref. 27), in 
reasonable agreement with the experimentally measured value. 
This value corresponds to a nuclei-induced dephasing time of 
T)* = \2h/oup = 360 + 30 ns. 

The reduced hyperfine coupling in silicon relative to other material 
systems will allow high-fidelity quantum control of future silicon- 
based qubits. Although the present device has only limited coherent 
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pulse detunings (residual J). Coloured circles show measured data; the least- 
squares fit (black lines) and 1-s.d. confidence intervals (coloured bands) for a 
hyperfine-averaged model”*”° are superimposed. The values of J/oyp resulting 
from this fit are shown in the key. d, Bloch sphere representations of an 
ensemble of trajectories, with J/o};; values as indicated by colour in c. The 
ensemble of rotation axes is also shown in each case. The data in c correspond to 
the average projection of trajectories onto the z axis. 
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Figure 5 | Effective exchange energy versus detuning. J(2) extracted from the 
spin funnel data (Fig. 2c), Rabi oscillation data (Fig. 3c) and T>* data (Fig. 4c). J 
ranges from 0.6 to 700 neV with excellent agreement where these data sets 
overlap (Supplementary Information). Error bars on the T>* data (blue) and 
the confidence interval for the Rabi data (green) are 1-s.d. fit-parameter 
uncertainties; spin funnel symbol sizes correspond to Gaussian-fit transition 
widths. The grey band denotes the estimated oy, value from the T>* data. 


control capabilities owing to its reliance on the small and fluctuating 
hyperfine field for one axis of control, future devices will almost 
certainly not rely on hyperfine interactions. Other control methods, 
such as exchange-only control of three-dot systems”, will allow com- 
plete coherent manipulation on a timescale much shorter than T,*. 
There are no fundamental problems with delivering exchange pulses 
on subnanosecond timescales’, which will further allow pulse com- 
pensation and dynamical decoupling techniques for high-fidelity gate 
operations”’*°. These techniques can be used in future efforts to study 
and eliminate other decoherence sources not yet isolated in the experi- 
ments described here. Our results therefore indicate strong potential 
for the execution of the quantum error correction protocols that allow 
sustained quantum computation in silicon-based systems. 


METHODS SUMMARY 


The device is characterized in a dilution refrigerator with an estimated effective 
electron temperature of 150 mK. The QPC conductance changes when electrons 
are loaded, unloaded or transferred between the dots and is used to determine the 
charge state of the quantum dots. Fast voltage pulses are applied to the left- and 
right-hand gates using a Tektronix 5014 arbitrary wave generator with low-pass 
filtering at its output. Measurements are conducted with an external in-plane 
magnetic field of 30 mT, except for the spin funnel experiment, in which the field 
is swept. 

The differential transconductance measurement technique used is implemented 
by superimposing a low-frequency voltage square wave (generally 600-900 Hz) on 
the left-hand gate and using standard lock-in amplification techniques with a 
30-ms time constant and 60-ms delay between measurements. The low-frequency 
square wave is achieved by concatenating an appropriate number of the higher- 
frequency coherent manipulation pulse sequences to bring the lock-in measure- 
ment frequency down to near the noise floor of our measurement system. The 
added square wave is present only during the measurement phase of the pulse 
sequences. Apart from a background signal resulting from coupling between the 
low-frequency square wave and the QPC, the differential transconductance 
measurement technique is sensitive only to changes in the QPC conductance 
resulting from changes in the double-dot charge state configuration. As a result, 
we see only lines marking the outlines of the charge stability regions and of the 
trapezoidal triplet blockade region in (0,2), instead of the more commonly seen 
QPC conductance plateaux. 

More details on experimental methods are available in Supplementary 
Information. 
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From determining the optical properties of simple molecular crystals 
to establishing the preferred handedness in highly complex verte- 
brates, molecular chirality profoundly influences the structural, 
mechanical and optical properties of both synthetic and biological 
matter on macroscopic length scales’”. In soft materials such as 
amphiphilic lipids and liquid crystals, the competition between local 
chiral interactions and global constraints imposed by the geometry of 
the self-assembled structures leads to frustration and the assembly of 
unique materials**. An example of particular interest is smectic 
liquid crystals, where the two-dimensional layered geometry cannot 
support twist and chirality is consequently expelled to the edges in a 
manner analogous to the expulsion of a magnetic field from super- 
conductors’ '°. Here we demonstrate a consequence of this geometric 
frustration that leads to a new design principle for the assembly of 
chiral molecules. Using a model system of colloidal membranes", we 
show that molecular chirality can control the interfacial tension, an 
important property of multi-component mixtures. This suggests an 
analogy between chiral twist, which is expelled to the edges of two- 
dimensional membranes, and amphiphilic surfactants, which are 
expelled to oil-water interfaces’”. As with surfactants, chiral control 
of interfacial tension drives the formation of many polymorphic 
assemblages such as twisted ribbons with linear and circular topolo- 
gies, starfish membranes, and double and triple helices. Tuning 
molecular chirality in situ allows dynamical control of line tension, 
which powers polymorphic transitions between various chiral struc- 
tures. These findings outline a general strategy for the assembly of 
reconfigurable chiral materials that can easily be moved, stretched, 
attached to one another and transformed between multiple confor- 
mational states, thus allowing precise assembly and nanosculpting of 
highly dynamical and designable materials with complex topologies. 

In experiments on chiral self-assembly, we used a two-component 
mixture consisting of 880-nm-long rod-like fd viruses and the non- 
adsorbing polymer Dextran. In aqueous suspension, fd viruses alone 
have purely repulsive interactions'’. Adding non-adsorbing polymer 
to a dilute isotropic suspension of fd rods induces attractive inter- 
actions by the depletion mechanism and leads to their condensation 
into colloidal membranes, which are equilibrium structures consisting 
of one-rod-length-thick, liquid-like monolayers of aligned rods” 
(Fig. la). Despite having different structures on molecular length 
scales, the coarse-grained properties of colloidal membranes are identical 
to those of conventional lipid bilayers. However, unlike their amphiphilic 
counterparts, colloidal membranes do not form vesicles and are instead 
observed as freely suspended disks with exposed edges. Here we invest- 
igate the behaviour of these exposed edges in a manner analogous to 
previously studied liquid-liquid domains embedded in lipid bilayers’**. 
For our experiments, it is essential that fd viruses are chiral, that is, a pair 
of aligned viruses minimizes their interaction energy when they are 
slightly twisted with respect to each other in a preferred direction. The 
strength of chiral interactions can be continuously tuned to zero through 
either genetic or physical methods'*” (Supplementary Fig. 1). 


Before investigating chiral membranes, we determined the edge 
structure of a membrane composed of simpler, achiral rods using three 
complementary imaging techniques, namely two-dimensional (2D) and 
three-dimensional (3D) polarization microscopy and electron micro- 
scopy. The local tilting of the rods within a membrane was determined 
using a 2D LC-PolScope, which produces images in which the intensity 
of each pixel represents the local retardance of the membrane’ (Fig. 1d). 
Such images can be quantitatively related to the tilting of the rods away 
from the layer normal” (the z axis). Rods in the bulk of a membrane are 
aligned along the z axis, and it follows that 2D LC-PolScope images 
appear black in that region (Fig. le). By contrast, the bright, birefringent 
ring along the membrane’s periphery reveals local tilting of the rods 
(Fig. le and Supplementary Fig. 2). For achiral rods, this indicates that a 
membrane has a hemi-toroidal curved edge (Fig. 1b, c). By comparison 
with an untilted edge, a curved edge structure lowers the area of the rod- 
polymer interface, thus reducing interfacial tension, at the cost of 
increasing the elastic energy due to a twist distortion. This hypothesis 
is confirmed by visualizing the 3D membrane structure using electron 
tomography, which shows that the viruses’ long-axis transitions from 
being parallel to the z axis in the membrane bulk to being perpendicular 
to the z axis and tangential to the edge along the membrane periphery 
(Fig. 1d and Supplementary Fig. 3). For achiral viruses, the spontaneous 
twist at the edges is equally likely to be clockwise or anticlockwise. 
However, the 2D LC-PolScope records only 2D projections of the 
birefringence map and thus cannot distinguish between these two pos- 
sibilities. For this reason, we used a 3D LC-PolScope”, which reveals 
that achiral rods are equally likely to tilt in either direction, thus con- 
firming the spontaneously broken chiral symmetry at the edge of an 
achiral membrane (Fig. 1f-h and Supplementary Fig. 4). Simulations of 
hard, achiral spherocylinders and depletant molecules provide addi- 
tional verification that interfacial effects alone cause spontaneous twist- 
ing of achiral rods at the membrane’s edge (Supplementary Fig. 5). 

When viewed with optical microscopy, a membrane’s edge shows 
pronounced thermal fluctuations, the analysis of which yields the line 
tension, Yer, which is the energetic cost required to move rods from the 
membrane interior to the periphery'*”'”’ (Fig. 2a). A typical fluc- 
tuation spectrum for an achiral edge is shown in Fig. 2b. In the regime 
of small wavevector, q, the mean square Fourier amplitudes, (a,°) are 
independent of q, allowing us to extract the effective line tension, 
Vert = kgT aq) (ref. 12), where kg is the Boltzmann constant and T 
is the temperature. In the large-q limit, fluctuations scale as 1/q° and 
are therefore substantially suppressed relative to those of a simple 
interface. One possible explanation is that fluctuations are suppressed 
as a result of a quasi-one-dimensional (quasi-1D) nematic phase at the 
edge, which is a direct consequence of rod tilting (Fig. le). At large 
wavelengths, the elastic energy required to deform the quasi-1D 
nematic phase is negligible and the fluctuations are dominated by 
the line tension. With decreasing wavelength, there is an increasing 
energetic penalty associated with bending the quasi- 1D nematic phase, 
resulting in the suppression of fluctuations. 
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Next we demonstrated that the chirality of the constituent rods 
controls the magnitude of yer. The global constraints imposed by 
the 2D membrane topology are fundamentally incompatible with local 
twisting (chiral interactions) between constituent rods, leading to the 
expulsion of chirality from the bulk of the membrane’ (untwisting of 
rods). It follows that introducing chirality raises the energy of rods in 
the membrane bulk while simultaneously lowering the energy of rods 
close to the edge, where they are free to twist and satisfy chiral inter- 
actions. Therefore, we propose that there are two contributions to the 
effective line tension, pare aNd Ychirap SUCH that yer = Yoare — Ychirab 
where pare is the line tension of a membrane edge composed of achiral 
rods and y¢hirai is the chiral contribution to the line tension'®”*. To 
Measure )pare ANd chiral independently, we used the temperature depend- 
ence of fd chirality (Supplementary Fig. 1). We measured ypare at high 
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Figure 1 | Edge structure of membranes 
assembled from an achiral mixture of wild-type 
and Tyr21Met fd phages. a, Differential 
interference contrast (DIC) image of a colloidal 
membrane. b, Illustration of colloidal membrane 
indicating that its edge adopts a surface-tension- 
minimizing curved profile, forcing rods to twist 
locally. For clarity, the aspect ratio of rods has been 
reduced. c, Cross-section shown in b. d, Electron 
micrograph of a membrane directly visualizing the 
curved edge profile (Supplementary Fig. 3). e, 2D 
LC-PolScope birefringence map of two 
membranes. The bright band associated with the 
edges indicates local rod tilting (Supplementary 
Fig. 2). f, 3D LC-PolScope image composed of an 
array of conoscopic microimages, each sampling 
the local orientation of the rods. In each 
microimage, the displacement of the dark spot 
away from the centre determines the local tilting of 
the rods with respect to the layer normal (the z axis; 
Supplementary Fig. 4). g, h, Sections of 3D LC- 
PolScope images of two edges with clockwise 

(g) and anticlockwise (h) twists, respectively, 
illustrating the broken chiral symmetry found at 
the edge of achiral membranes. Scale bars, 5 um 
(a, e-g, h); 0.2 um (d). 


temperature (60 °C), where rods are achiral (7hirai = 0) and, therefore, 
Yett = Ybare (Fig. 2c). Decreasing the temperature increases the strength of 
chiral interactions, resulting in a smaller ).¢-value and demonstrating that 
chirality indeed reduces line tension (Fig. 2c). Temperature-dependent 
chiral contributions (yhiral(T) = —Yer(T) + Yoare) Were converted into 
a function that depends on the chiral wavevector, qo, using measure- 
ments of the temperature dependence of the cholesteric twist 
(Supplementary Fig. 1). Notably, measurements of ).yira) at different 
depletant concentrations collapse onto a universal curve (Fig. 2d), con- 
firming that the two contributions to y.¢ are uncorrelated. Our data 
show that chirality can reduce the line tension by as much as 450kgT 
per micrometre of edge length (Fig. 2d). To test this hypothesis inde- 
pendently, we measured ye¢ for membranes composed of the 
Tyr21Met mutant fd virus. By contrast with wild-type fd virus, the 
chirality of this mutant fd virus is temperature independent (Sup- 
plementary Fig. 6). We find that ). for the Tyr21Met mutant mem- 
branes is temperature independent over the entire measurement range, 
unambiguously demonstrating chiral control of line tension and ruling 
out any intrinsic temperature effects. 

Our findings raise the possibility that at sufficiently low temperatures 
the chiral contribution to interfacial energy could dominate the bare 
line tension, lowering the energetic cost of creating edges and leading to 


Figure 2 | Chiral control of effective line tension, yegg. a, DIC images, taken 
1s apart, illustrating the fluctuations of the membrane’s edge. Scale bar, 5 um. 
b, Fluctuation spectrum ((a, 2) versus q) plotted for three different 
temperatures. For small q, (a, ) is independent of q and inversely proportional 
to the effective line tension, Yer = = kpTKa,” ). For large q, the suppressed 
fluctuations are independent of depletant concentration (osmotic pressure) 
and sample temperature (fd chirality) and scale as 1/q’. Red lines are fits using 
equation (3) (Methods). c, yer extracted from the low-q regime, plotted for a 
range of sample temperatures (fd chirality) and Dextran concentrations 
(osmotic pressures). In the achiral limit at 60 °C, ychiral = 0 and pare = Veet: 
Increasing the Dextran concentration increases )/pare. Decreasing the 
temperature reduces Ye because chiral increases. The red lines of fixed slope are 
guides to the eyes illustrating the universal scaling of jeg with chirality. d, )chirat 
as a function of the twist wavevector, qo (measured in the cholesteric phase), for 
various depletant concentrations. The temperature dependence of qo is 
extracted from Supplementary Fig. 1. The red curve is quadratic fit to the data: 
Vchiral = Cqo’- Error bars, s.d.; 1 = 10. 
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Figure 3 | Transition of a 2D disk into 1D twisted ribbons. a, A temperature 
quench reduces the line tension, inducing a transition of the 2D membrane into 
1D twisted ribbons (Supplementary Movie 1). b, The stability diagram indicates 
regions of phase space where ribbons and membranes are observed as a function 
of temperature (chirality) and Dextran concentration (Cpextran). The boundary 
indicates the temperature above which no stable or metastable ribbons are 

observed at a given Dextran concentration. c, An overlaid phase contrast (red)/ 
fluorescence (green) image ofa stable ribbon containing a low volume fraction of 


their spontaneous formation. In this respect, we expect chirality to have 
a role similar to that of amphiphilic surfactants in oil-water mixtures. 
Such surfactants lower surface tension and lead to the formation of 
excess interfaces through the assembly of microemulsions. To invest- 
igate this possibility, we quenched a membrane assembled at low deple- 
tant (Dextran) concentration from the high-temperature achiral phase. 
With decreasing temperature, fluctuations of the membrane edge 
become more pronounced, indicating that yg is decreasing. 
Eventually the edge becomes unstable, resulting in a remarkable poly- 
morphic transition in which twisted, ribbon-like structures grow along 
the entire periphery of the disk, generating a starfish-shaped membrane 
(Fig. 3a and Supplementary Movie 1). This polymorphic transition is 
reversible; with increasing temperature, the starfish membrane col- 
lapses back into a disk. The stability diagram indicates that 1D twisted 
ribbons are stable at low Dextran concentrations and low temperatures 
(small ),,,¢ Values and large ypira1 Values), whereas flat 2D membranes 
are stable for large ypare values or small y.yira) Values (Fig. 3b). DIC 
microscopy reveals the overall structure of the ribbons, such as their 
pitch and width (Fig. 3a); fluorescence microscopy shows the liquid-like 
dynamics of single rods within the assemblage (Fig. 3c and Sup- 
plementary Movie 2); and 2D LC-PolScope imaging quantitatively 
determines the local tilt of the rods (Fig. 3d). A schematic representation 
of the twisted ribbons based on this information, indicating inhomo- 
geneous tilting of the rods within a ribbon, is shown in Fig. 3e. Such data 
are in agreement with recent theoretical predictions”. 

Chiral self-assembly produces structures that combine seemingly 
divergent properties of high elasticity and fluidity. To demonstrate these 
properties, we trapped two opposite sides of a flat disk with a dual-beam 
optical trap and applied an extensional force, causing the transition from 
a stretched disk into a twisted ribbon (Fig. 3f and Supplementary Movie 
3). This mechanically induced disk-to-ribbon transition is reversible; on 
removal of the optical trap, the highly elastic ribbon relaxes back into its 
original shape. Furthermore, using optical tweezers it is possible to 
change the architecture of the chiral assemblages systematically, allow- 
ing new pathways for assembling nanomaterials with highly complex 
topologies. For example, two ends of a linear twisted ribbon are easily 
joined together to assemble a closed, ring-like polymeric structure. The 
disk-to-ribbon transition is tightly coupled to the assemblage topology 
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fluorescently labelled rods (Supplementary Movie 2). d, LC-PolScope image 
indicating rod tilting that penetrates from the edge of the ribbon towards its 
centre. The intensity at each pixel is proportional to sin*(0), where 0 is the local 
rod tilt angle away from the image plane. e, Schematic structure of twisted 
ribbons as deduced from optical microscopy. f, A 2D colloidal membrane is 
trapped with a dual-beam optical trap and stretched, inducing the transition to a 
1D twisted ribbon (Supplementary Movie 3). Red circles indicate moving traps 
and green circles indicate fixed traps. Scale bars, 2 |1m. 


(Fig. 4d). On increasing the temperature, all linear ribbons transform 
into achiral disks (Supplementary Movie 4), whereas closed, ring-like 
ribbons remain twisted owing to the constraint imposed by the ring 
topology (Supplementary Movie 5). 

The chiral assembly pathways described here are hierarchical, open- 
ing up the possibility that simple changes on microscopic (angstrém) 
length scales can be used to control structures on macroscopic 
(millimetre) length scales*”*. At the smallest relevant length scale, 
each virus is assembled from DNA that is coated with thousands of 


Figure 4 | Hierarchical self-assembly: from isolated viruses and metastable 
disks to singly and doubly twisted ribbons. a, In the ribbon region of the 
phase diagram in Fig. 3b, fd viruses condense into metastable, self-limited disks. 
On longer timescales, these disks coalesce and form 1D twisted ribbons. 

b, Image of a single ribbon that has collapsed into a toroid. c, Doubly twisted 
ribbons are assembled by wrapping around each other two singly twisted 
ribbons with a well-defined phase difference (Supplementary Fig. 8). 

d, Directed assembly of a ring-like supramolecular polymer from a linear 
twisted ribbon using optical tweezers. e, f, Single point mutation of the major 
coat protein permits microscopic control of ribbon chirality. Wild-type fd virus 
assembles into left-handed ribbons (e), whereas the Tyr21Met fd virus forms 
right-handed ribbons (f), as demonstrated by the z-stack series of 2D LC- 
PolScope images. Scale bars, 2 jum. 
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copies of the major coat protein. At the mesoscopic length scale, 
attractive depletion interactions condense dilute, rod-like viruses into 
self-limited, metastable, mesoscopic colloidal disks whose diameter is 
approximately one twist penetration length’*’*’”** (Fig. 4a). Once 
assembled, mesoscopic disks themselves experience mutual attractions, 
resulting in lateral coalescence and formation of twisted doublets. On 
longer timescales, this coalescence process continues, producing 
ribbons with complex topologies and multiple branching points (Sup- 
plementary Fig. 7). The length of these ribbons can reach macroscopic 
(millimetre) dimensions. This hierarchical assembly process describes 
a robust synthesis of ribbons that behave as supramolecular polymers 
and can assemble into even more-complex structures at higher levels of 
hierarchy, such as toroids (Fig. 4b). Under certain conditions, ribbons 
also wrap around each other to produce a great variety of double- and 
triple-helical structures (Fig. 4c and Supplementary Fig. 8). A challenge 
associated with hierarchical assembly pathways is to control the final 
macroscopic assemblage by specific modification of relevant micro- 
scopic parameters. Previous work showed that even a single amino- 
acid mutation of the major coat protein can alter the microscopic 
chirality of individual viruses'*. We used this finding to demonstrate 
that such amino-acid exchange propagates across all hierarchical levels, 
switching the ribbon handedness and allowing for microscopic control 
of macroscopic ribbon chirality (Fig. 4e, f). 

We have shown that the chirality of the constituent molecules can be 
used to tune the line tension associated with the exposed edge of a 
colloidal monolayer membrane. The potential of hierarchical chiral 
self-assembly is demonstrated by assembling reconfigurable materials 
with unique mechanical properties and complex topologies that can 
easily switch between multiple polymorphic forms”. 


METHODS SUMMARY 


Wild-type and Tyr21Met fd viruses were purified according to previously published 
protocols'*. To reduce the number of longer viruses, which destabilize monolayer 
structures such as colloidal membranes or twisted ribbons, virus suspensions were 
cycled through isotropic-nematic phase coexistence. Only the isotropic portion of 
the samples, enriched in shorter viruses, was used for experiments'!. The polydis- 
persity of viruses was quantified using gel electrophoresis (Supplementary Fig. 9). All 
samples were prepared in a buffer containing 20 mM Tris and 100 mM NaCl at pH 
8.0. Addition of the non-adsorbing polymer Dextran (relative molecular mass, 
500,000; Sigma-Aldrich) induces condensation of rod-like viruses into either flat 
colloidal membranes or various other chiral assemblages. The strength of chiral 
interactions between rods was tuned either by changing the sample temperature 
or by varying the ratio of wild-type fd virus to Tyr21Met fd virus in membranes 
containing a mixture of the two’’. Optical microscopy data was taken with an 
inverted microscope (Nikon TE-2000) equipped with DIC optics, a homemade 
heating/cooling stage and a cooled charged-coupled-device camera (QImaging, 
Retiga Exi). The 2D and 3D LC-PolScope imaging set-ups have been described 
elsewhere’*”’. To prevent binding of viruses to surfaces, microscope slides and cover- 
slips were cleaned and subsequently coated with a silane agent (3-(trimethoxysilyl) 
propyl methacrylate, Sigma Aldrich) to which a polyacrylamide brush was polymer- 
ized from methacrylate groups. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Virus stock solutions. Filamentous viruses were grown using standard biological 
techniques that have been discussed elsewhere*’. The resulting virus suspensions 
contain a low fraction of multimeric virus particles (dimers and trimers) with 
longer contour lengths. Because high monodispersity in particle length is essential 
for assembly of membranes and ribbons, the polydispersity was reduced by frac- 
tionating the samples through the isotropic-nematic phase transition, as longer 
viruses preferentially dissolve in the nematic liquid-crystalline phase. Suspensions 
of fd viruses (2-8 ml) were prepared at isotropic-nematic phase coexistence. Only 
the isotropic (top) portion of the sample, enriched in monomeric viruses, was used 
for subsequent experiments. The fractionation was repeated once or twice to 
obtain sufficiently monodisperse suspensions that reproducibly assemble into 
colloidal membranes and ribbons. The polydispersity of virus suspensions was 
quantified using gel electrophoresis of intact viruses. Approximately 20 ul of a 
0.5mg ml virus solution was loaded into a 1.8% agarose gel and run at 
1.4V cm! for approximately 8h (ref. 31). The virus coat protein was dissolved 
by immersing gels in 0.2M NaOH for 30min, and the remaining DNA was 
visualized with ultraviolet light after staining with ethidium bromide’! 
(Supplementary Fig. 9). Only virus suspensions with a monodispersity greater 
than 95% were used for experiments. For florescence microscopy, viruses were 
labelled with florescent dyes as described previously**. The wild-type fd virus was 
labelled with Alexa 488-NHS ester (Invitrogen) and the Tyr21Met fd virus was 
labelled with DyLight 546-NHS ester (Pierce). All samples were buffered in 20 mM 
Tris, pH 8.0, to which 100 mM NaCl was added to screen electrostatic repulsion 
between viruses. The concentration of the viruses was determined using absorp- 
tion spectroscopy”. 

Tuning the twist of the cholesteric phase. The strength of chiral interactions 
between viruses was quantified by measuring the pitch of a cholesteric liquid- 
crystalline sample. Glass cylindrical capillaries, 0.7 mm in diameter (Charles 
Supper Company), were filled with fd virus suspensions. For the cholesteric tex- 
ture to develop properly, it was necessary to clean the capillaries in a hot detergent 
solution (Hellmanex) after soaking them in 5M NaOH. After equilibration for 
several hours, the samples show a well-known fingerprint texture, seen by polar- 
ization microscopy, that is characteristic of cholesteric liquid-crystalline phases'*”” 
(Supplementary Fig. 1). The twist wavevector, qo = 2n/P, was determined by 
measuring the cholesteric pitch, P. Bulk samples were prepared at 100 mg ml '. 
Previous work, as well as experiments described in Supplementary Fig. 4, indicate 
that this is the effective virus concentration in membranes and ribbons'!. We 
assume that qo is the same for bulk cholesteric phase and 2D membranes. 

Two complimentary methods were used to tune the strength of chiral interac- 
tions between fd viruses: genetic mutations of the major coat protein and physical 
control of sample temperature. Most aspects of the wild-type fd virus phase 
behaviour, such as the location of the isotropic-nematic phase transition, are 
independent of temperature, as is expected for entropic suspensions of hard, 
rod-like particles. An exception is that the wild-type fd virus persistence length 
is weakly dependent on temperature’. Data shown in Supplementary Fig. 1 indi- 
cates that decreasing temperature increases the strength of chiral interactions 
between the viruses, as evidenced by the measurements of the cholesteric pitch’”. 
Notably, the twist wavevector shows evidence of a second-order phase transition 
from a high-temperature achiral phase to a low-temperature cholesteric phase. By 
contrast with the temperature-dependent chirality of wild-type fd virus, the chirality 
of Tyr21Met fd virus does not show any temperature dependence (Supplementary 
Fig. 6). 

An alternative control of chirality involves modifying the composition of the 

major coat protein. For example, Tyr21Met fd virus, a mutant that differs from 
wild-type fd virus by a single point mutation of the major coat protein, forms a 
cholesteric helix with the opposite handedness'*. By controlling the ratio of wild- 
type to Tyr21Met fd virus, it is possible to tune the chirality of the resulting sus- 
pension”’. Such mixtures still form colloidal membranes, and fluorescence micro- 
scopy indicates that the two components are uniformly dispersed on optically 
resolvable length scales (~300 nm). We have used both temperature and genetic 
methods to tune the chirality of the constituent rods. 
Light microscopy sample preparation. Light microscopy chambers were 
assembled using glass slides and coverslips (Goldseal, Fisher Scientific) with a 
layer of unstretched Parafilm as a spacer. Glass slides and coverslips were coated 
with an acrylamide brush to prevent non-specific binding of viruses to the glass 
surfaces, by a procedure described elsewhere. We mixed fd virus with the non- 
adsorbing polymer Dextran (relative molecular mass, 500,000; Sigma-Aldrich) to 
from membrane and ribbons samples. The virus concentration remained fixed at 
10mg ml ’, and the Dextran concentration varied between 30 and 50 mg ml as 
indicated in the main text. The samples, sealed using ultraviolet-cured glue 
(Norland Optical) and stored at 4°C, remained good for weeks to months. 


Optical microscopy. Samples were characterized using a number of complimentary 
optical microscopy techniques. Except those made using the 3D LC-PolScope, all 
observations were made using a X 100 oil-immersion objective (Plan Fluor; numerical 
aperture, 1.3) on an inverted microscope (Nikon TE-2000). Images were recorded 
with cooled charge-coupled-device (CCD) cameras (CoolSnap HQ, Photometrics, 
or Retiga Exi, QImaging). The microscope was equipped with traditional polar- 
ization optics, a DIC module, a fluorescence imaging module and 2D LC-Polscope 
microscopy’®. For 3D PolScope measurements, we used a Zeiss Axiovert 200M 
microscope with a Plan Apochomat oil-immersion objective (X63; numerical 
aperture, 1.4) and a monochrome CCD camera” (Retiga 4000R, QImaging). 

Sample temperature was tuned between 4 and 60 °C with a home-made Peltier 
module equipped with a proportional-integral—derivative temperature controller 
(ILX Lightwave LPT 5910). The temperature-controlling side of the Peltier device 
was attached to a copper ring fitted around the microscope objective, which heats 
or cools the sample through the immersion oil. A thermistor, placed in the copper 
ring adjacent to the sample, enabled the proportional-integral-derivative feed- 
back necessary to adjust the temperature. Excess heat was removed using a con- 
stant flow of room-temperature water. 
3D LC-PolScope microscopy. Membrane edges with left-handed and right- 
handed twists appear indistinguishable when viewed with 2D LC-PolScope 
because this technique provides only a 2D projection of the sample birefringence. 
To determine the 3D orientation of rods, we used 3D LC-PolScope microscopy”®. 
The 3D LC-Polscope extends the capabilities of the 2D LC-PolScope by placing a 
microlens array in the image plane of the objective lens. The microlens array 
generates a hybrid image consisting of a series of conoscopic microimages, each 
probing a different spatial area in the image plane. Analysis of each conoscopic 
microimage reveals the local birefringence as a function of the propagation direction 
of transmitted light rays. Supplementary Fig. 4a illustrates the path of the optical rays 
from the specimen plane to the back focal plane of the 3D LC-PolScope microlens 
array. Rays shown in green and red pass perpendicularly through the membrane, 
parallel to its optical axis, and experience no differential retardation. Rays shown in 
black pass at an angle to the optical axis and are differentially retarded. Rays with the 
same tilt angle in the specimen plane are focused at the same point in the back focal 
plane of the objective. The array of microlenses, placed in the image plane of the 
objective, refocuses these rays into a plane behind the microlens array. Behind each 
microlens, an image of the objective’s aperture appears containing multiple cono- 
scopic images, each specific to rays that have passed through a small region of the 
specimen. For example, rays emanating from the region X of the sample fall onto the 
microlens X’. Therefore, only rays that passed through X contribute to the cono- 
scopic image behind X’. The same argument applies to rays passing through Y and 
Y’, and so on. 

The conoscopic image of a single microlens focused on the bulk portion of the 
membrane shows a radially symmetric distribution of the retardance that increases 
away from the centre of the image (Supplementary Fig. 4b). In the centre, the rays 
of light are aligned with the orientation of the viruses and the retardance is zero. 
Away from the centre, the retardance increases because the rays of light become 
tilted with respect to the orientation of the viruses. When the viruses are tilted with 
respect to the optical axis, for example near the edge of the membrane, the radial 
profile obtained from a conoscopic image becomes asymmetric and indicates local 
tilting of the molecules. For colloidal membranes composed of an achiral mixture 
of wild-type and Tyr21Met fd viruses, the viruses at the edge of a membrane are 
found to twist in clockwise and anticlockwise directions with equal probability 
(Supplementary Fig 4c-e). 

Analysis of the 3D LC-PolScope retardance profile is used to determine the 
concentration of viruses within a membrane, because the membrane birefringence 
is proportional to the local rod concentration. It follows that the local retardance of 
the membrane is given by R = AngatdcSsin?(0)/sin(«), where 0 is the local tilting of 
the rods, c is the concentration of rods within a membrane, d is the thickness of the 
colloidal membrane, S is the local order parameter (assumed to be ~1), Ariat is the 
specific birefringence of a fully aligned bulk sample of fd virus at unit concentra- 
tion as determined by X-ray scattering experiments” and « is the orientation of the 
rays of light as a function of the distance, r, to the centre of the conoscopic image. 
This orientation is given by sin(«) = rNA/rnan, where rya = 3.9 um is the size of 
the conoscopic image, NA = 1.4 is the numerical aperture and n = 1.33 is the index 
of refraction of the solvent. The fitting of this functional form to the retardance 
profile yields a mean concentration of viruses in the bulk of the membrane of 
c=103+10mgml | (Supplementary Fig. 4f). Similar values where obtained by 
an independent method that involves counting the number of fluorescently labelled 
particles'’. Finally, we also note that the concentration of the viruses does not 
appreciably change with varying Dextran concentration (Supplementary Fig. 4g). 
Measurement of the line tension. Line tension was measured by directly imaging 
and analysing thermal fluctuations of the membrane’s edge with DIC optical 
microscopy using well-established methods'*. Lower line tension indicates that 
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less energy is required to place the particles at an interface, resulting in larger 
fluctuations of the membrane’s edge. The conformation of the membrane’s edge is 
described by h(x), the local height of the interface, where x is the lateral position 
along the interface. The total length of the interface is 


dh\* 
L= | dxy/1 —— 1 

Jaya) o 

For small fluctuations (dh/dx = tan(0) ~ 0), the expression for the excess length of 
the edge relative to a flat interface can be approximated as 

1 

AL= ex 140-15 | dx 6G) 

If line tension, y, is the energetic cost per unit length of the edge, the free-energy 


difference between fluctuating and straight edges is given by AF = yAL. Next we 
decompose @(x) into a Fourier series: 


O(x) = - [ay cos (qx) 


Here L is the length of the interface and the wavevector, q, is given by q = nn/L. 
The energetic cost of the membrane conformation specified by Fourier amplitudes 
ag is given by 


L a 
ary as 24) = 2 2 
AF=yAL z] exo a 


By invoking the equipartition theorem, it is possible to relate the line tension to 
fluctuations of the interface: 


(a2) =kyT/y (2) 


To measure (a,”) experimentally, we acquired a series of uncorrelated images of 
the interfacial contour. To ensure optimal contrast, the DIC shear axis was always 
perpendicular to the membrane’s edge. Under these conditions, intensity profile 
cuts taken perpendicular to the edge could be fitted to a Gaussian, yielding the 
conformation of the edge with subpixel accuracy. Each conformation is described 
in terms of the Fourier amplitudes, aj. Averaging over a sufficient number of 
uncorrelated images yields a fluctuation spectrum as shown in Fig. 2b, where 
the mean square amplitudes, (a,”), are plotted as functions of q. In the limit of 
small q, (a,’) is independent of wavenumber as predicted by equation (2). As q 
increases, the fluctuations are suppressed and scale as q*. The entire fluctuation 
spectrum can be quantitatively described by the following equation: 
a (3) 


yng 


The 1/q° regime can be obtained by including terms that are proportional to the 
square of the gradient of 0(x) in equation (1). This indicates that there is a bending 
energy, Kk, associated with fluctuations of the interface, which might be related to 
the existence of a thin, quasi-1D nematic phase located at the edge of the mem- 
brane, where the rods are tilted and the layer’s smectic ordering is destroyed. 
Laser tweezers. Colloidal membranes and ribbons are easily manipulated using 
optical tweezers. The laser tweezer set-up was built around an inverted Nikon TE- 
2000 microscope. A 1,064-nm laser beam (Compass 1064, Coherent) is time- 
shared between two points by means ofa pair of orthogonally oriented paratellurite 
(TeO3) acousto-optic deflectors (Intra-Action). The laser is projected onto the 
back focal plane of an oil-immersion objective (Plan Fluor 100, NA = 1.3) and 
subsequently focused onto the imaging plane. Using custom LABVIEW software, 
multiple trap locations were specified and used to stretch and manipulate mem- 
branes and ribbons. 
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Electron microscopy. For transmission electron microscopy, membranes were 
assembled as described above but with the following differences. The monolayer 
membranes are relatively small and only stable in solution; therefore, to keep them 
intact throughout fixation and staining, we immobilized the membrane assemblies 
by embedding them in 1% low-melting-temperature agarose (SeaKem; gelation 
temperature, 25 +5 °C), in 100mM NaCl, 20mM Tris, pH 8.15, and 150mM 
sucrose as a cryoprotectant. Virus samples were mixed in 1.5-ml Eppendorf tubes 
with fluid agarose at 30 °C and kept warm for 3-6 h to allow formation of relatively 
large membranes as confirmed by light microscopy. Subsequently, a drop of the 
fluid sample containing membranes was transferred into the 0.1-mm-high cavity 
of an aluminium planchette for high-pressure freezing (Wohlwend), which was 
then closed. Then the sample temperature was decreased to 4 °C for 30-45 min. 
The gelled samples were rapidly frozen using a Leica HPM-100 high-pressure 
freezer (Leica Microsystems). Using a Leica AFS-2 device, the frozen samples were 
freeze-substituted at low temperatures (starting at —90 °C) over the course of three 
days in a solution containing 1% osmium tetroxide (EMS), 0.5% anhydrous 
glutaraldehyde (EMS) and 2% water in anhydrous acetone (AC32680-0010, 
Fisher Scientific). After the temperature was raised to 4 °C, the sample was infil- 
trated and embedded in EMbed 812-Resin (EMS). Ultrathin, 70-nm, sections were 
collected on slot grids covered with Formvar support film and post-stained with 
uranyl acetate (supersaturated solution) and 0.2% lead citrate, before inspection 
on a FEI Morgan 268 transmission electron microscope with a 1k CCD camera 
(GATAN) and on a 300-keV Tecnai F30 intermediate-voltage transmission elec- 
tron microscope (FEI) with a 4k CCD camera (GATAN). For electron tomo- 
graphy, we used 150-nm-thick sections that were coated with 10-nm colloidal 
gold particles. Tilt series were recorded over an angular range of +60° with 1° 
increments on the Tecnai F30 using the microscope control software 
SERIALEM”*. Tomographic reconstructions were calculated with the IMOD soft- 
ware using fiducial alignment and weighted back-projection”’. 

Computer simulations of the rod/polymer mixture. In the computational 
model, the rods were represented as hard spherocylinders and the non-adsorbing 
polymers (depletant) were modelled as ghost spheres, which act as hard spheres 
when interacting with rods but can freely interpenetrate one another**. We per- 
formed Metropolis Monte Carlo simulation with periodic boundary conditions. 
The system contained 6,000 rods in a box with dimensions 140 X 140 X 50 (in 
units of the rod diameter). A constant osmotic pressure was maintained by coup- 
ling ghost spheres to a constant-chemical-potential bath through insertion and 
deletion moves. The simulations thus sampled the N,oallsphereVT ensemble, where 
Nyoa is the number of rods, [sphere is the sphere chemical potential and V is the 
volume. The rod aspect ratio was 20, the ghost-sphere diameter was 1.5 and on 
average there were about 520,000 spheres. Simulations were initialized with rods in 
a monolayer, and Monte Carlo simulation was performed until rod density within 
the membrane and rod orientations equilibrated. Rods at the membrane edge 
twisted spontaneously regardless of the initial rod orientations in the membrane. 
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Kimberlite ascent by assimilation-fuelled buoyancy 


James K. Russell!, Lucy A. Porritt!, Yan Lavallée? & Donald B. Dingwell* 


Kimberlite magmas have the deepest origin of all terrestrial magmas 
and are exclusively associated with cratons'*. During ascent, they 
travel through about 150 kilometres of cratonic mantle lithosphere 
and entrain seemingly prohibitive loads (more than 25 per cent by 
volume) of mantle-derived xenoliths and xenocrysts (including 
diamond)**. Kimberlite magmas also reputedly have higher ascent 
rates®° than other xenolith-bearing magmas’®"’. Exsolution of dis- 
solved volatiles (carbon dioxide and water) is thought to be essential 
to provide sufficient buoyancy for the rapid ascent of these dense, 
crystal-rich magmas. The cause and nature of such exsolution, 
however, remains elusive and is rarely specified®’. Here we use a series 
of high-temperature experiments to demonstrate a mechanism for 
the spontaneous, efficient and continuous production of this volatile 
phase. This mechanism requires parental melts of kimberlite to ori- 
ginate as carbonatite-like melts. In transit through the mantle litho- 
sphere, these silica-undersaturated melts assimilate mantle minerals, 
especially orthopyroxene, driving the melt to more silicic composi- 
tions, and causing a marked drop in carbon dioxide solubility. The 
solubility drop manifests itself immediately in a continuous and vig- 
orous exsolution of a fluid phase, thereby reducing magma density, 
increasing buoyancy, and driving the rapid and accelerating ascent of 
the increasingly kimberlitic magma. Our model provides an explana- 
tion for continuous ascent of magmas laden with high volumes of 
dense mantle cargo, an explanation for the chemical diversity of 
kimberlite, and a connection between kimberlites and cratons. 

Our current understanding of kimberlite ascent is greatly hampered 
by uncertainty about the compositions of primary kimberlite melts'**"”. 
These compositions remains elusive for three reasons: kimberlites 
contain abundant (>30%) xenocrystic material; kimberlite melt 
quenched to glass has not been observed; and kimberlites are highly 
susceptible to alteration owing to their ultrabasic composition’ and 
age’. Many strategies have been used to estimate kimberlite melt com- 
positions’*'* but the range of postulated compositions remains large 
and contentious (SiO, < 25-35%, MgO 15-25%)—especially the con- 
tent of dissolved volatiles (CO, and H,O)'*?*"’. Physical properties 
estimated for the putative range of melt compositions include densities 
similar to basaltic or komatiitic melts (2,800-2,900 kg m °) and low 
viscosities (10 7-10* Pas)”. 

Mantle xenoliths transported by kimberlite constrain the formation 
depths of these magmas to the base of the cratonic mantle lithosphere 
(CML) or deeper*. Mechanical disaggregation of xenoliths produces 
the mantle-olivine-dominated (>25 vol.%) suite of xenocrysts char- 
acteristic of most kimberlites (Fig. 1a). Notably, orthopyroxene (opx), 
which comprises ~15-27% of the CML'*”* and is the most silicic 
mineral present (Fig. 1b), is rare-to-absent in kimberlite. Where 
observed, opx texturally records severe disequilibrium (Fig. 1c)*'”"*. 
The high crystal contents suggest bulk densities of the kimberlite 
magma (melt+ crystal) in the range of 3.0-3.1 gcm’ *. Buoyant ascent 
of kimberlite magma from depths >120km can be greatly enhanced 
by the presence of a low-density exsolved fluid phase (density of CO, 
fluid peo, ~ 1.2 gcm ° at pressure P ~ 2 GPa). 

Here we present a mechanism for the rapid ascent of these 
deep-seated, high-density magmas. Our model is inspired by new 


Figure 1 | Transmitted-light optical microscope images of thin sections of 
kimberlite, peridotite and orthopyroxene. a, Kimberlite hosts abundant 
rounded centimetre- to millimetre-scale mantle-derived xenocrysts of olivine 
(ol) and subordinate amounts of xenocrystic clinopyroxene (cpx), garnet (gar) 
and ilmenite (ilm). Orthopyroxene (opx) is conspicuously absent. b, Peridotitic 
xenolith from kimberlite, showing original mantle-equilibrated mineralogy 
(ol > opx > cpx > gar), textures, and grain size and shape distributions of 
mantle minerals. Grains of opx are marked with ‘+’ (image courtesy of M. 
Kopylova). c, Kimberlite containing a fragment of disaggregated mantle 
peridotite (arrow), comprising a grain of partially dissolved opx (+; grey grain) 
within larger ol grain (yellow grain) (image courtesy of C. Brett). 
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Figure 2 | CO, solubilities in silicic to carbonate melts (after ref. 19). a, CO, 
solubility limits for silica-saturated (S) and undersaturated (US) melts and 
hypothetical solubilities of carbonatite and kimberlite melt. b, Pressure and 
composition dependence of CO, solubility across the carbonate-silicate 
transition. Note limited effects of pressure (numbers on lines, MPa) on CO, 
solubility versus the strong effects of composition (that is, SiO. + ALO; wt%). 
c, Schematic model for assimilation-induced fluid-exsolution (that is, 
vesiculation) of carbonatitic or proto-kimberlitic melts. Opx assimilation drives 
non-silicate melts (left side) to more silicic compositions (towards right side), 
causing a decrease in CO) solubility and exsolution ofa fluid phase. Continued 
assimilation causes continued exsolution, and results in a kimberlitic melt and 
olivine saturation at about 18-32 wt% SiOg. 


experimental data on the solubility of volatiles (that is, CO2) in melts 
across the silicate-carbonate transition” (Fig. 2). This work elucidates 
the pressure and compositional controls on CO, solubility in silicic 
(for example, basalt to melilite) to carbonatitic melts (Fig. 2a,b) and 
provides three main insights. First, the solubility of CO (and H20) 
in silicic melts increases with pressure but, importantly, in magmas 
with SiO, + Al,O; > 35 wt%, solubility is limited to ~10-15%. This 
precludes extraordinary amounts of volatile being sequestered in 
parental kimberlitic melts’”°. Second, carbonatitic melts (that is, 
SiO, + Al,O3 < 20 wt%) have substantially higher CO, solubilities 
that are essentially independent of pressure (Fig. 2b). Solubility of 
CO, in carbonate melts is limited only by melt stoichiometry, and 
can be >40% (refs 21, 22). Last, the transition from carbonatitic to 
silicic (that is, kimberlitic) melt compositions is accompanied by a 
pronounced decrease in CO solubility (Fig. 2b). 

We suggest that melts parental to kimberlite originate as carbonatitic 
or near-carbonatitic melts. Such melts begin with near-stoichiometric 
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CO, contents (~40 wt%) and are buoyant relative to the cratonic mantle 
lithosphere (density p ~ 2.6-2.8 gcm *)”. Carbonatitic melts can also 
accommodate substantial amounts of water (10 wt% at 0.1 GPa) without 
reducing their CO, contents”. The melts entering the CML initiate 
the mechanical incorporation and disaggregation of xenoliths®**, 
thereby ensuring that newly liberated xenocrysts are continuously fed 
into the melt. At these pressures (P<2.5GPa) and temperatures 
(T> 1,250 °C), all mantle minerals are out of equilibrium and react 
with the low-agio, carbonatitic melt” (asin, is the activity of SiO;). 
Orthopyroxene, as the most silica-saturated phase, has the highest 
affinity for dissolution and is assimilated rapidly and preferentially 
over other phases*’”?>*”, 

Assimilation of opx drives the melt towards more silicic com- 
positions (Fig. 2c). The increased SiO, content of the contaminated 
carbonatitic melt causes a decrease in CO) solubility that is expressed 
by immediate fluid exsolution deep within the CML. Continued opx 
assimilation is attended by continuous fluid exsolution, leading to 
ever-increasing buoyancy. The process is gradual until opx assimila- 
tion produces a ‘kimberlitic melt composition (SiO, > 18 wt%), which 
initiates a catastrophic drop in CO) solubility. This results in massive 
exsolution of a volatile phase that reduces magma density, increases 
buoyancy, and supports rapid and accelerating ascent. 

Results from a series of high-temperature melting experiments test 
these inductive ideas, and illustrate the potential speed and efficiency of 
the process. Our experiments involve melting powdered mixtures of 
Na,CO; and natural mantle-derived opx (MgSiO3; Fig. 3a), and mea- 
suring the weight loss as a proxy for CO, loss. The procedures and 
resulting data are fully described in Methods and Supplementary 
Table 1. 

In non-steady-state (‘transient’) experiments, the melt rapidly 
assimilates the opx, creating a more silicic melt that is oversaturated 
in dissolved CO}. The melt exsolves dissolved CO) (that is, undergoes 
decarbonation), causing a weight loss proportional to the opx content 
(Fig. 3b). These experiments demonstrate the spontaneous nature and 
rapid (~20 min) rate of the assimilation and decarbonation reactions. 
Reaction (that is, weight loss) continues until all of the opx has dis- 
solved, the melt has reached its maximum SiO, content, and the CO, 
solubility limit of the new more silicic melt is reached (Fig. 3b). Static 
experiments track the change in melt composition and the production 
of CO, via melt decarbonation (Fig. 3c,d). The SiO. and MgO contents 
of the hybrid melts are greater than the bulk compositions (Fig. 3a) 
owing to enrichment by CO, loss. Na,O content shows an initial 
increase and then decreases owing to the combined effects of dilution 
(addition of opx) and enrichment (decarbonation). CO, decreases 
dramatically. The addition of ~42 wt% opx produces SiO)-rich 
(~32.7 wt%) melts in which all CO, is lost due to decarbonation 
(Fig. 3d). The maximum amount of CO, that would be released from 
the magma during assimilation to fuel the ascent of kimberlite is ~25- 
30 wt% (Fig. 3c, d). 

Primary mantle-derived carbonatitic to near-carbonatitic (that is, 
low-dsio,) melts provide an appealing model for parental kimberlite 
magma. Carbonatite melts are accepted as a common product of partial 
melting of carbonated mantle sources at pressures >2.5 GPa (refs 21, 
28). Recent experimental work shows that the solidus melts of carbo- 
nated peridotite at >2.5 GPa will always be CO rich and SiO poor as 
long as carbonate is stable in the mantle assemblage’'. These experi- 
mentally produced melts contain in excess of 40% dissolved CO,, and 
can even accommodate substantial dissolved HO (ref. 23); we suggest 
that the asthenospheric production of such melts marks the onset of 
kimberlite ascent (Fig. 4a, b). Our mechanistic model for kimberlite 
ascent (Fig. 4b, c) supports petrogenetic models that interpret the 
diversity of kimberlite bulk rock compositions as mechanical mixing 
of mantle olivine (70-80%) and a carbonate-rich melt” combined with 
assimilation of mantle orthopyroxene (20-30%)*”. 

We modelled the chemical evolution of the ascending melt by 
assuming that the amount of opx assimilated is linearly related to the 
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Figure 3 | High-temperature analogue experiments. a, Compositions 
produced by mechanical mixing of sodium carbonate (Na,CO3) and 
orthopyroxene (opx, MgSiO3). (Data given in Supplementary Table 1). Shaded 
field (K) denotes range of SiOz contents for kimberlite melts based on Igwisi 
Hills kimberlite (Supplementary Table 2). b, Results of transient high- 
temperature experiments, showing mass change with time due to 


distance travelled in the mantle lithosphere, and that assimilation and 
decarbonation are essentially instantaneous (left dashed line, Fig. 4d; 
Supplementary Table 2). This process can be sustained energetically by 
the adiabatic decompression of the magma during its ascent, which can 
result in considerable (150-200 °C) superheat*’. After assimilation of 
~ 15-20% opx, a carbonatite melt will evolve to a composition close to 
melt compositions estimated for the Igwisi Hills kimberlite lavas** 
(SiO, 16-22 wt%; Supplementary Table 2; Fig. 4d). The amount of 
CO, released during assimilation and available to fuel the ascent of 
the contaminated carbonatite (kimberlite in the widest sense) is 
~25%, Ultimately, continued opx assimilation drives the melt towards 
olivine saturation (Fig. 2c), allowing for late-stage magmatic crystal- 
lization of olivine as overgrowths observed on rounded olivine xeno- 
crysts or as microphenocrysts*!**°”* (Fig. 2c). Olivine crystallization 
would logically coincide with the later stages of kimberlite ascent and 
emplacement, and clearly follows on from an earlier and more extens- 
ive period of olivine disequilibrium. 

The assimilation of opx is the key to this process, because it dissolves 
on timescales that compete with kimberlite ascent rates**””. The rapid 
and efficient dissolution of opx, relative to other mantle xenocrysts, 
explains the absence of xenocrystic opx in kimberlite*'’. However, the 
main mechanistic consequence of opx assimilation is to decrease the 
CO, solubility of the melt, thereby driving deep-seated exsolution and 
effervescence of a CO;-rich volatile phase (Fig. 4c). The exsolved fluid 
enhances the buoyancy of the ascending magma within the deep man- 
tle, and provides an explanation for its rapid ascent and its ability to 
continuously entrain and transport mantle cargo to the Earth’s surface. 

Our model for kimberlite ascent is unique in that it begins with a 
low-dsio, melt containing stoichiometric amounts of dissolved CO3. 
The CO, is stable within the melt across a wide range of pressure- 
temperature conditions and, thus, exsolution of the fluid phase is not 
driven by depressurization, as it is in most magma ascent models. 
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decarbonation driven by opx assimilation. Total weight loss (4) is reported in 
parentheses. c, Melt compositions produced by chemical reaction during 
melting of Na,CO3-opx mixtures versus opx content. Shaded fields mark 
kimberlite (K) melts and condition for olivine saturation and crystallization. 
d, Post-experiment distribution of CO, between hybrid melt and the exsolved 
fluid phase. Distance between two lines is total exsolved (released) COp. 


Rather, CO, fluid exsolution is driven by an unavoidable chemical 
reaction (that is, assimilation) that creates a new more silicic melt 
whose CO, solubility is exceeded; the exsolution is spontaneous and 
immediate. Although the initial fluid production is not driven by a 
change in pressure, the buoyancy of the fluid-saturated magma will be 
greatly enhanced by the volumetric expansion of the fluids attending 
depressurization. This expansion allows for continuous acceleration of 
the magma, entrainment of greater quantities of mantle material, and 
decoupling (that is, separation) of a CO,-rich fluid phase from the 
ascending magma to create precursory fenitization (alkali metasoma- 
tism) events*®. 

Our model provides an explanation for the linkage between 
kimberlites and cratons; it is clear that, if the parental magmas are 
carbonatitic, that a ready source of highly reactive opx is necessary to 
produce a more silica-rich melt (that is, kimberlite) that will exsolve 
fluid in the deep mantle. The cratonic mantle lithosphere has two 
attributes that support this transformation: first, it is enriched in opx 
(15-30%) relative to other mantle lithosphere'*"*; and second, it is 
thick (90-120 km), and thus provides ample opportunity for sampling 
of opx-rich mantle. As long as the evolving carbonatitic melt is exposed 
to new opx, the assimilation-induced exsolution of CO) will continue 
and buoyancy is maintained or increased. Therefore, we might expect 
the diversity of kimberlite at the Earth’s surface to actually reflect the 
mineralogical composition, thermal state and the thickness of the 
underlying mantle lithosphere rather than the source region of the melt. 

The corollary is that if opx were absent, this process would not be 
viable, because the dissolution rates for other mantle silicates (which 
could also promote an assimilation-driven exsolution of fluid) are too 
low to keep pace with ascent rate. In addition, the enthalpy of opx 
dissolution is less than the heat of crystallization for olivine, implying 
that opx assimilation can be balanced energetically by smaller quantities 
of olivine crystallization and a little cooling'*. Dissolution of olivine, 
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Figure 4 | Model for assimilation-fuelled ascent of kimberlite. a, Diverse 
ascent paths through cratonic mantle lithosphere (CML) shown as dashed 
arrows. Also shown is the line below which diamond is stable relative to 
graphite. b, Melts produced by melting of carbonated peridotite transit mantle 
lithosphere as dykes by crack-tip propagation, liberating dense (sinking) 
xenoliths to the CO,-rich silica-undersaturated melt, and causing effervescence 
of buoyant (rising) CO}-fluid. c, Xenoliths disaggregate and release individual 
mineral grains (for example, ol) to carbonatitic melt; opx grains are assimilated, 
preferentially promoting volatile exsolution. Deep-seated volatile production 
supports continued, crack-propagation-limited magma ascent. d, Chemical 
evolution of melt during ascent, including: left panel, wt% of opx assimilated 
(dashed line, dissolved), and CO, content in bulk system and in melt (solid 
lines); right panel, oxide (SiO2, MgO + FeO, CaO) contents of the evolving melt 
(solid lines), and the 100 Ca/(Ca + Mg) ratio of melt (dashed line). Initial 
carbonatitic and derivative kimberlite melts (shaded boxes) defined by SiO, 
contents (Supplementary Table 2). 


for example, would require substantial cooling of the magma (as well 
as longer times). In parts of the Earth where carbonatitic magmas do 
not transit the cratonic mantle lithosphere, there are only two possible 
outcomes. The carbonatitic melt successfully travels through a rela- 
tively thin lithosphere to be emplaced or erupted as carbonatite, or the 
carbonatite melt incorporates opx-poor mantle and crystallizes at 
depth because it fails to assimilate mantle silicates efficiently enough 
to promote a deep-seated fluid phase and the requisite buoyancy. 
We suggest the possibility that all kimberlites start life as carbona- 
titic melts produced through partial melting of carbonated peridotite 
in the deep subcratonic mantle. As the melts transit the mantle litho- 
sphere they continuously incorporate and disaggregate peridotite 
xenoliths. The low dsio, of the carbonatitic melt causes the rapid 
and preferential dissolution of opx relative to the other silicate mantle 
minerals. This increases the silica content of the melt and triggers 
immediate exsolution of CO, reducing the buoyancy of the magma 
and facilitating rapid ascent. The faster the magma rises, the more 
mantle material will be entrained and the more opx will be dissolved. 
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Providing opx is available, this mechanism enables the continuous and 
accelerating ascent of the magma and the evolution of the melt from 
carbonatitic to kimberlitic compositions. 


METHODS SUMMARY 


Our experiments use synthetic melts of Na,CO3 (melting temperature 
Tm = 851 °C) as an analogue for a natural carbonatitic-like melt. The experiments 
involve the melting of mixtures of NazCO; and crushed natural opx at tempera- 
tures of 1,000-1,100 °C for ~1 h; the mixture is open to the air. Pure NayCO; melt 
contains ~44 wt% dissolved CO, and is stable at these conditions. Melting of 
Na,CO3-opx mixtures produces vigorous decarbonation expressed by weight loss. 
Decarbonation results from reduced CO, solubility in the hybrid melt produced by 
opx assimilation (Fig. 2). Both transient and static experiments were performed. In 
the former, an initial mass of sample (NayCO; + opx) of known composition was 
lowered from a balance into a preheated furnace; cumulative weight loss was 
recorded as a function of time. The transient experiment on pure Na,CO3 
(>1h at 1,100 °C) shows only slight (<1.8%) change in mass and demonstrates 
the stability of stoichiometric amounts of CO) in the melt. Transient melting of 
Na,CO3-opx mixtures causes immediate reaction as opx is assimilated; the result- 
ing hybrid melts exsolve CO} causing a continuous but decreasing loss in weight, 
until the equilibrium solubility limit of the new melt composition is reached, and 
weight loss ceases (<20 min). The magnitude of weight loss is taken as a quant- 
itative measure of CO, loss. The tangents to the weight loss curves give the relative 
rates of assimilation and degassing. Static experiments saw known masses and 
compositions of NayCO3 + opx placed into a pre-heated oven (1,000-1,100 °C) 
for a fixed time (Supplementary Table 1; Fig. 3a). The weight loss at the end of each 
experiment was measured to provide a quantitative estimate of decarbonation. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Experiments. We have used a series of high-temperature melting experiments to 
illustrate aspects of the proposed model for the ascent of kimberlite. Our experi- 
ments use synthetic melts of sodium carbonate as an analogue for a natural 
carbonatitic melt that could be parental (that is, pre-assimilation) to kimberlites. 
The experimental results provide a semiquantitative test of the ideas advanced in 
our manuscript. The experiments (for example, conditions, melt compositions, 
and scales) are not intended to mimic the natural process exactly, but to illustrate 
what is possible and how the process may operate during magma transport. 

Our experimental design follows ref. 34, in which was explored the melting 
reactions between superliquidus sodium carbonate (Na,COs) and silica sand. Pure 
Na2CO3; melt contains ~44 wt% (50 mol%) dissolved CO. Fusion of the mech- 
anical mixtures of Na,CO ; and silica sand at temperatures above melting 
(Tm > 851°C) caused the NazCO3 melt to react vigorously with the silica grains 
and exsolve CO, to the atmosphere**. The magnitude of CO, loss was quantita- 
tively expressed as a loss in weight and trivial weight loss due to Na2O volatilization 
was demonstrated**.The decarbonation of the melts continues until the SiO, 
contents of the hybrid melts exceed the stoichiometry of sodium orthosilicate 
(Na4SiO, ~ 32.7 wt%) causing crystallization of the silicate phase. 

Our experiments used Na,CO; melts at superliquidus temperatures of 1,000- 
1,100°C into which natural orthopyroxene (opx, MgSiO3) had been added 
(Supplementary Table 1). The high superliquidus temperatures were chosen to 
ensure complete reaction and to suppress crystallization of new oxide or silicate 
minerals and are not necessarily applicable to the natural process (that is, 
kimberlite ascent). The opx derives from a fresh sample of cratonic mantle 
peridotite from the Jericho kimberlite, NWT, Canada’. The sample was coarsely 
crushed and opx grains were hand-picked under a binocular microscope; an 
aliquot of >150g of orthopyroxene grains was then washed, dried and finely 
crushed to a coarse powder (<500 lm). We then prepared batches (15-20 g) of 
pre-mixed starting materials comprising precisely measured masses of oven dried 
NazCO3 powder and coarsely powdered opx; the mass fractions of the batches 
covered a range of opx contents from 0 to 40 wt% (Supplementary Table 1). The 
preparation of large batches of starting mixtures facilitated performing replicate 
experiments on the same bulk compositions. 

Two types of experiments were performed. First, we performed transient 
experiments wherein a known mass of sample (NaxCO; + opx) of known com- 
position (0, 25 and 40 wt% opx) was placed in a platinum crucible and lowered 
from an electronic balance into a preheated furnace. The experiments were held at 
the controlled furnace temperature (1,000-1,100 °C) for 1-1.5 h. Each experiment 
provides a record of total change in mass of the crucible plus sample as a function 
of time. The transient experimental set-up was also used on a pure NayCO; melt in 
which no opx was introduced (0% opx, Supplementary Table 1). This experiment 
(Fig. 3b) demonstrates the long-term stability of stoichiometric amounts of dis- 
solved CO, (~40 wt%) in sodium carbonate melts. Over the course of the experi- 
ment (>1h at 1,100 °C) there is only a slight (<1.8%) cumulative change in mass, 
indicating that, even though performed in air, there is little to no dissociation of the 
melt or degassing. This stability of the NayCO3 melt is also demonstrated by the 
static experiments (Supplementary Table 1, see below). 

In the other transient experiments, the NaxCO; powder melts and begins to 
react immediately with the mechanically mixed-in grains of opx. The Na,CO3 
melts assimilate the opx powder creating a more silicic melt oversaturated in 
dissolved CO. The melt is forced to exsolve (effervesce) the dissolved CO, (that 
is, decarbonation), causing a continuous loss in weight (16% and 21%; Fig. 3b). 
The gas exsolution reaction continues until the solubility limit of the new melt 
composition is reached and the loss of weight with time ceases. 

Interpretation of results. The transient experiments provide two insights. First, 
the final weight loss is a measure of the total CO} lost and defines the residual CO2 
in the melt; that value represents the new equilibrium CO, solubility of the hybrid 
melt. Second, the tangent to the mass loss versus time curve is a record of the 
relative rates of assimilation and degassing. The slope and implied rates are high 
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initially and then decrease exponentially as the equilibrium solubility limit for the 
new hybrid melt is reached. The transient experiments show the spontaneous 
nature and rapid rate of the assimilation and decarbonation reactions; virtually 
all of the loss in mass occurs in the first <20 min, after which there is little change, 
indicating that the assimilation-induced decarbonation process is over. The 
experiments rapidly converge to an equilibrium state wherein all of the opx has 
been dissolved, the melt has reached its maximum SiOz content and the CO; 
solubility has been decreased as far as possible. At this point, the melt retains 
the amount of CO, dictated by its composition; the addition of more opx would 
foster more reaction, more weight loss, and a new more silicic melt further 
depleted in COs. 

We also performed static experiments wherein a known mass of sample 

(Na,CO; + opx) of known composition (0, 25 and 40 wt% opx) was measured 
into a platinum crucible, weighed, and then placed within a pre-heated oven 
(1,000-1,100 °C) for a fixed time (>35-75min) (Supplementary Table 1; 
Fig. 3a). The sample was then removed, cooled in a desiccator, weighed and the 
total weight loss determined. Most experiments were run in duplicate or triplicate 
and several duplicates are reported in Supplementary Table 1. 
Data analysis. All hybrid melt compositions are computed assuming that all 
available opx dissolves into carbonate melt without crystallization to create a more 
siliceous melt (Supplementary Table 1). Each experiment was examined visually 
for signs of crystallization immediately on being removed from the furnace at the 
end of the experimental dwell time. In every case, the crucibles contained only a 
clear translucent liquid. The residual CO, content of the melt is also calculated 
assuming all mass loss is due to decarbonation (that is, Na2O volatilization is 
negligible**) and establishes the solubility limits of the hybrid melts. 

These experimental results are used to predict the chemical evolution of an 
idealized carbonatitic melt ascending through the mantle lithosphere and pref- 
erentially assimilating opx (Fig. 4d; Supplementary Table 2). These model calcula- 
tions use the following end-member compositions and relationships. We use as 
the parental carbonatitic melt composition KM29 from ref. 36, a melt composition 
produced by melting (1,430°C, 7 GPa) of carbonated peridotite. The opx is 
assumed to have a single composition equivalent to that reported in sample 25- 
9 of ref. 35.We have modelled the CO, content (in wt%) of the evolving melt (W,) 
using the experimentally derived (Fig. 4d) expression: 


We= Wil — 2.17 Wop) (1) 


where W,; is the CO, content (in wt%) of the idealized parental carbonatitic melt, 
and Wopx is the weight fraction of opx assimilated. 

The range of melt compositions generated in this fashion are reported in 
Supplementary Table 2, where they are compared against the melt compositions 
estimated for the Igwisi Hills kimberlite**. The Igwisi Hills compositions include 
two measurements on matrix-rich portions of kimberlite lava and two model 
compositions produced by correcting one of the lava compositions for the com- 
positional effects of xenocrystic and phenocrystic olivine. For the purposes of 
comparison, Supplementary Table 2 reports the Igwisi Hills compositions on a 
reduced basis wherein MgO represents the sum of MgO and FeO. The best agree- 
ment between our model melt compositions resulting from progressive assimila- 
tion of opx and the Igwisi Hills kimberlite melt estimates occurs at between 15 and 
20 wt% assimilation. 


34. Hrma, P. Reaction between sodium carbonate and silica sand at 874°C < T 
1022°C. J. Am. Ceram. Soc. 68, 337-341 (1985). 

35. Kopylova, M. G., Russell, J. K. & Cookenboo, H. Petrology of peridotite and 
pyroxenite xenoliths from the Jericho kimberlite: implications for the thermal state 
of the mantle beneath the Slave Craton, Northern Canada. J. Petrol. 40, 79-104 
(1999). 

36. Dalton, J.A.& Presnall, D.C. Carbonatitic melts along solidus of model lherzolite in 
the system CaO-—MgO-Al203-Si02-CO>» from 3 to 7 GPa. Contrib. Mineral. Petrol. 
131, 123-135 (1998). 
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Recovery rates reflect distance to a tipping point in a 


living system 


Annelies J. Veraart', Elisabeth J. Faassen', Vasilis Dakos', Egbert H. van Nes!, Miquel Lirling’? & Marten Scheffer! 


Tipping points, at which complex systems can shift abruptly 
from one state to another, are notoriously difficult to predict’. 
Theory proposes that early warning signals may be based on the 
phenomenon that recovery rates from small perturbations should 
tend to zero when approaching a tipping point **; however, evidence 
that this happens in living systems is lacking. Here we test such 
‘critical slowing down’ using a microcosm in which photo-inhibition 
drives a cyanobacterial population to a classical tipping point when a 
critical light level is exceeded. We show that over a large range of 
conditions, recovery from small perturbations becomes slower as the 
system comes closer to the critical point. In addition, autocorrelation 
in the subtle fluctuations of the system’s state rose towards the 
tipping point, supporting the idea that this metric can be used as 
an indirect indicator of slowing down*’. Although stochasticity 
prohibits prediction of the timing of critical transitions, our results 
suggest that indicators of slowing down may be used to rank com- 
plex systems on a broad scale from resilient to fragile. 

Systems ranging from the brain and society to ecosystems and the 
climate can have tipping points at which minor perturbations can invoke 
a critical transition to a contrasting state’. The complexity of such sys- 
tems prohibits accurate predictive modelling. However, it has been sug- 
gested that even without mechanistic insight, the proximity of a tipping 
point may be inferred from generic features of fluctuations and spatial 
patterns that can be interpreted as early warning indicators'”’. This idea 
is based on the phenomenon that at bifurcation points at which stability 
of an equilibrium changes, the dominant real eigenvalue becomes zero”’. 
As a result, the rate of recovery from perturbation should go to zero as 
such bifurcations are approached (Supplementary Notes 1). This phe- 
nomenon, which is known as critical slowing down, is well established 
in physics but it has only recently been suggested that the recovery rate 
from perturbations could be an indicator of the distance to a tipping 
point in complex living systems such as ecosystems’. 

Although the prospect that the fragility of living systems could be 
probed this way is attractive, experimental evidence has so far been 
lacking. Instead, much work has been focused on ways to infer slowing 
down from indirect indicators such as autocorrelation and variance. 
However, although these indicators are linked to slowing down in 
simple stochastically forced models’*”, recent theoretical studies indi- 
cate that the indirect indicators will not always respond in simple 
ways" (Brock, W. A. & Carpenter, S. R., submitted). This is confirmed 
by empirical studies on the climate’, the food web of a lake’* and 
laboratory populations of water fleas'*. In these systems, trends in 
indirect indicators occurred but were not all consistent. Here we use 
acontrolled system in which there is positive feedback between organisms 
and their physical environment to test critical slowing down directly 
from recovery rates. 

We exposed cyanobacteria in chemostat microcosms to increasing 
light stress. This is a well understood system for which models have 
shown alternative stable states and tipping points’*'*. Cyanobacteria 
provide the shade needed for their own growth, creating a positive 
feedback, and this constitutes the mechanism behind the bistability. 


Although light is needed for photosynthesis, light levels that are too 
high are detrimental to primary producers such as the cyanobacteria 
that we used. Mutual shading can ameliorate this stress, and is thus one 
of the ways in which facilitation can outweigh competitive interactions 
under harsh conditions’®. Such feedback between organisms and their 
environment is the mechanism behind alternative stable states in a 
range of ecosystems”. Indeed, as a result of the facilitative shading, our 
system can maintain a high biomass under incident light levels that 
prohibit growth in low biomass cultures. A fold bifurcation that repre- 
sents a classical tipping point occurs at the light level at which this 
mechanism becomes too weak to allow persistence of the population’** 
(see Supplementary Notes 1 for a model analysis). Here, shading 
becomes insufficient to prevent growth inhibition, and the resulting 
loss of biomass further weakens the stabilizing shading effect. This 
implies that a positive feedback is driving the system towards a crash. 

We cultured cyanobacteria in two independently controlled chemostat 
microcosms (M1 and M2) and increased incident light daily in small 
steps to the point at which the population collapsed (see Methods). We 
perturbed the populations every 4-5 days by removing 10% of their 
biomass through dilution. Consistent with the model results (Sup- 
plementary Fig. 1.1) the populations maintained a relatively high 
biomass throughout the experiment until they collapsed rather 
markedly when a tipping point was reached (Fig. 1 and Supplemen- 
tary Notes 3). Recovery rates of both systems decreased gradually 


Collapse 


Microcosm 1 
Microcosm 2 


Light attenuation coefficient (m~) 


320 


Time (days) 


Figure 1 | The response of two populations of cyanobacteria 
(Aphanizomenon flos-aquae) to dilution events under a regime of gradually 
increasing light levels. Dilution events are indicated as perturbations p1—p6. 
The light attenuation coefficient is a measure of population density. Thin curve 
segments represent the baselines that were used for computing recovery rates. 
The inset shows the experimental system. 
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Figure 2 | Indicators of slowing down as a function of light intensity. 

a, Recovery rates after perturbation (pl-p6). b, Photosynthetic efficiency 
(photosystem II quantum yield). c, Autocorrelation in the population density 
estimator for each day based on 30 min average [,,, data. d, Corrected variance 
in daily time series (see Methods and Supplementary Notes 4). Fisher’s 
combined (one-tailed) significance test of the slopes of the regression line’s 
(n = 6) in both microcosms: recovery rate, 7°44 = 18.48, P = 0.001 (if the p6 
perturbations are excluded (n = 5), 04 = 12.39, P = 0.015); photosynthetic 
efficiency n = 45, 74 = 147.66, P< 0.001; autocorrelation, n = 23, 774 = 14.00, 
P= 0.007; variance, not significant, n= 23, 4 = 7.72, P= 0.102. 


towards the tipping point, starting far from the bifurcation (Fig. 2a), 
and tended to decline more rapidly towards the tipping point. This was 
also predicted by our model (Supplementary Fig. 1.1). 

In most complex systems, the mechanisms that are involved in 
causing the slowing down will be difficult to unravel. However, in 
our particular system the photosynthetic capacity of the cyanobacteria 
that is at the heart of their growth potential can be sensed through 
measurements of the efficiency of their light harvesting system (see 
Methods). Whereas biomass remained relatively high in the trajectory 
towards the critical threshold, this specific indicator of their vigour 
declined linearly with increasing light stress, approaching zero at the 
point of collapse (Fig. 2b). This is an independent confirmation that 
the light-induced stress to the cyanobacteria in the two independently 
operated microcosms does indeed undermine the resilience of the 
system to the point at which collapse is inevitable. 

In systems that are subject to stochastic perturbation regimes, slow- 
ing down is predicted to be reflected in characteristic changes in fluc- 
tuations of the state. In particular, it has been proposed that increases 
in autocorrelation and variance can be interpreted as indirect indica- 
tors of slowing down'*”. Although our experiment was not primarily 
designed to study the effect of stochastic perturbation regimes, there 
are continuous subtle fluctuations in our measurements of the density 
of the cyanobacterial population. These fluctuations will reflect a mixture 
of factors including measurement noise and instabilities in the lighting 
system as well as true population fluctuations induced by the slightly 
fluctuating conditions in the bubbled microcosms. We studied how 
autocorrelation and variance in these subtle fluctuations changed as 
our system approached the critical point. For this we analysed all the 
stretches of continuous measurements of 1 day between the experi- 
mental interruptions caused by the daily stepwise increase of light 
intensity and the dilution perturbation events (see Methods). 
Although autocorrelation was quite variable in the time series that 
we studied, there was a significant increase towards the tipping point 
in both experimental systems (Fig. 2c). No trend in variance was 
apparent in the systems (Fig. 2d, see also Supplementary Notes 4). 
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These results are consistent with the prediction that measuring recovery 
rates from perturbations is a robust way to detect critical slowing 
down", whereas indirect indicators of slowing down may or may not 
increase towards a critical threshold'!. Also, our findings are consistent 
with the prediction that autocorrelation is usually directly related to 
slowing down and may therefore provide a more robust signal than 
variance in some situations’. 

Perturbation experiments such as the ones in our experiment will 
often be impossible in large complex systems, leaving indirect indica- 
tors as the only tool by which to infer slowing down. However, experi- 
mental probing of recovery rates may be feasible in some smaller 
systems, as long as the timescales are appropriate and stochastic fluc- 
tuations are small relative to experimental perturbations. Even in larger 
systems, local perturbations may be an option to probe resilience, 
allowing adaptive management to steer the system away from the brink 
of collapse. 

Perhaps most importantly, our experimental demonstration of slow- 
ing down implies a proof of concept, providing a fundamental basis to the 
current search for generic early warning signals in systems ranging from 
the brain and ecosystems to society and the climate’. The fact that slowing 
down in our system started far from the critical point suggests that 
recovery rates as well as indirect indicators may be used to rank such 
complex systems on a broad scale from stable to critical. This does not 
mean that slowing down can be used to actually predict transitions. 
Stochastic shocks will trigger critical transitions always before the bifurca- 
tion point is reached, indicating that there is inherent unpredictability in 
systems. Nevertheless, the prospect of having generic indicators of 
resilience is a potentially large step forward. Mechanistic models to 
predict tipping points in nature and society accurately are simply 
beyond our reach, leaving empirical estimation of fragility as one of 
the key challenges in complex systems science today’. 


METHODS SUMMARY 


Experiments were performed in two identical flat chemostat microcosms’? (M1 
and M2) in which we cultured cyanobacteria (Aphanizomenon flos-aquae (L.) 
Ralfs) on a nutrient-rich growth medium, modified from BG11 medium”. Light 
irradiance was increased in steps of 23 umol photons m *s’' for M1 and 29 mol 
photons m~? s~! each day for M2. Photosynthetic efficiency was measured from 
diurnal samples. The intensity of the light passing through the chemostats was 
averaged at 5-min intervals, and light attenuation coefficients (¢, m') were cal- 
culated as an indicator of biomass: 


ec = In(Tout/Tin) 
— d 


where J, is the intensity of the incoming light (WE), Iout is the intensity of the 
outgoing light (UE) and d is the depth of the chemostats (m). External perturba- 
tions were performed every 4-5 days by diluting the culture with 170 ml of sterile 
medium. A baseline for calculating recovery rates was constructed for each per- 
turbation event (Supplementary Notes 2 and Supplementary Fig. 2.1) by fitting a 
quadratic curve from the period just before perturbation to the period just before 
the next perturbation (thin curves in Fig. 1). Recovery rates after each perturbation 
(A, per day) were then calculated from a linear regression of -In(é — &,.) against 
time, where ép is the light attenuation of the baseline (m') and ¢, is the light 
attenuation of the chemostat (m7 ')?. 

The lag 1 autocorrelation and variance were analysed for each uninterrupted 
period between the daily manipulations after removing the trends from each period 
by fitting polynomials of 2 degrees. To check for effects of nonlinear propagation of 
measurement noise, we constructed a null model that assumed that all residuals 
were due to uncorrelated normally distributed noise. As variance showed a trend 
towards the bifurcation in this null model (Supplementary Table 4.1), we corrected 
the observed variance by subtracting the median of the null model (Fig. 2d). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Experimental conditions. Experiments were performed in two identical flat 
chemostats (V 1.7 1, 0.05 m optical path length)’’. In these chemostats we cultured 
the cyanobacterium Aphanizomenon flos-aquae (L.) Ralfs on a nutrient-rich sterile 
growth medium that was modified from BG11 medium”. The chemostats were 
kept at a stable temperature of 21 °C. A continuous flow of moistened air of 60-100 
ml min ' was supplied through a sintered glass sieve at the bottom of the vessel to 
ensure homogenous mixing of the culture. The air was mixed with CO, to satisfy 
the inorganic carbon need of the culture. CO, flow was adjusted, when needed, to 
maintain a pH of between 7.1 and 8.1. The chemostats were run at a dilution rate of 
0.18 per day for chemostat 1 (named M1) and 0.21 per day for chemostat 2 and 
(M2). They were illuminated using white LED lamps (SL3500w, Photon Systems 
Instruments). Light irradiance was increased by 23 mol photons m *s__' per day 
for M1 and 29 umol photons m * s~’ per day for M2 by a Light Studio 1.3 12C 
interface (Photon Systems Instruments, Brno). 

Daily maintenance and measurements. Each day the walls of the chemostats were 
scraped with a magnetic stirrer to prevent cyanobacterial attachment. After scraping, 
we took samples to determine chlorophyll a concentrations and photosystem II 
quantum yield (in triplicate) using a PhytoPAM (phyto-ED), and to determine 
biovolume in a 400-11 sample volume (in triplicate) using a Casy TT Cell Counter, 
with a 150-1m capillary (Innovatis AG Casy Technology). The intensity of the light 
penetrating through the chemostat was recorded continuously using RA100 light 
sensors (Bottemanne Weather Instruments) that were attached to the outer wall of 
the chemostat and stored as 5-min averages on Squirell SQ1000 dataloggers (Grant 
Instruments). The light sensors were removed during scraping of the chemostats. 
Perturbations. At an incoming light intensity of 477 umol photons m~*s_' for 
M1 and 571 for M2, the light attenuation coefficients of the chemostats became 
stable. From this moment on, perturbations were performed every 4-5 days by 
diluting the culture with 170 ml of sterile medium. The dilution was always per- 
formed 2 h after the daily stepwise increase in light. 

Recovery rates. We used the calculated light attenuation (Fig. 1 and 
Supplementary Note 2) as a measure of the cyanobacterial biomass’!. Before 
calculation of recovery rates, the light data were corrected for sensor attachment 
differences (Supplementary Note 2). Vertical light attenuation (¢, m_') was cal- 
culated from the corrected light data: 


ites In(Iout/Tin) 
é= a 


where J;,, is the intensity of the incoming light, [ou is the intensity of the outgoing 
light (both measured in jumol photons m~*s_ ') and dis the optical path length of 
the chemostats (m). Light attenuation data were smoothed for calculation of 
recovery rates by taking a moving average of 1h. 

To calculate recovery rates, a baseline was constructed for each perturbation 
event. This baseline was obtained by fitting a quadratic curve from the period just 
before perturbation to the period just before the next perturbation (Fig. 1). 
Parameters for the baseline were estimated by forcing it through the sets of ¢ 
and t (time, day) at the start and end of the curve, and by forcing the slope at 
the start of the curve. The start slope was determined by the slope of the light 
attenuation data in the 20h before disturbance. 

Recovery rate after perturbation (A, per day) was defined by an exponential 
model’: 


where é is the light attenuation coefficient of the baseline and «¢, is the light 
attenuation of the chemostat (both m‘!). We calculated by a linear regression 
of — In(é9 —é.) against time. To avoid the effect of the light data correction for 
sensor position (Supplementary Note 2), recovery rates were calculated only on 
the first 18-20h after perturbation. In this period there was no change in light 
meter position. Finally, the recovery rates were linearly regressed against incoming 
light intensity. 
Autocorrelation and variance. Autocorrelation and variance of the continuous 
small fluctuations in our time series were analysed for each uninterrupted period 
between the daily manipulations and light increments. We performed all analyses 
on untransformed data (5-min averages of light attenuation data) as well as on data 
that were averaged over non-overlapping periods of 30 min. We removed the 
trends from each period with a constant light level by fitting polynomials of 2 
degrees to the light attenuation, and we used the residuals to calculate the auto- 
correlation by fitting an autoregressive model of lag 1 and variance by estimating 
sample variance per day. 

We analysed the effect of measurement noise using a null model (see 
Supplementary Notes 4). 


21. Huisman, J. & Weissing, F. J. Light-limited growth and competition for light in 
well-mixed aquatic environments: an elementary model. Ecology 75, 507-520 
(1994). 
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Evolution of increased complexity in a molecular 


machine 


Gregory C. Finnigan'*, Victor Hanson-Smith”**, Tom H. Stevens! & Joseph W. Thornton”*"° 


Many cellular processes are carried out by molecular ‘machines’ — 
assemblies of multiple differentiated proteins that physically inter- 
act to execute biological functions’ *. Despite much speculation, 
strong evidence of the mechanisms by which these assemblies 
evolved is lacking. Here we use ancestral gene resurrection” ' and 
manipulative genetic experiments to determine how the complexity 
of an essential molecular machine—the hexameric transmembrane 
ring of the eukaryotic V-ATPase proton pump— increased hundreds 
of millions of years ago. We show that the ring of Fungi, which is 
composed of three paralogous proteins, evolved from a more ancient 
two-paralogue complex because of a gene duplication that was 
followed by loss in each daughter copy of specific interfaces by 
which it interacts with other ring proteins. These losses were com- 
plementary, so both copies became obligate components with 
restricted spatial roles in the complex. Reintroducing a single 
historical mutation from each paralogue lineage into the resurrected 
ancestral proteins is sufficient to recapitulate their asymmetric 
degeneration and trigger the requirement for the more elaborate 
three-component ring. Our experiments show that increased com- 
plexity in an essential molecular machine evolved because of simple, 
high-probability evolutionary processes, without the apparent 
evolution of novel functions. They point to a plausible mechanism 
for the evolution of complexity in other multi-paralogue protein 
complexes. 


Comparative genomic approaches suggest that the components of 
many molecular machines have appeared sequentially during evolu- 
tion and that complexity increased gradually by incorporating new 
parts into simpler assemblies**. Such horizontal analyses of extant 
systems, however, cannot decisively test these hypotheses or reveal 
the mechanisms by which additional parts became obligate com- 
ponents of larger complexes. In contrast, vertical approaches that 
combine computational phylogenetic analysis with gene synthesis 
and molecular assays allow changes in the sequence, structure and 
function of reconstructed ancestral proteins to be experimentally 
traced through time." Here we apply this approach to characterize 
the evolution of a small molecular machine and dissect the mechan- 
isms that caused it to increase in complexity. 

The vacuolar H*-ATPase (V-ATPase) is a multisubunit protein 
complex that pumps protons across membranes to acidify subcellular 
compartments; this function is required for intracellular protein traf- 
ficking, coupled transport of small molecules and receptor-mediated 
endocytosis’. V-ATPase dysfunction has been implicated in human 
osteoporosis, in acquired drug resistance in human tumours, and in 
pathogen virulence'**. A key subcomplex of the V-ATPase is the Vo 
protein ring, a hexameric assembly that uses a rotary mechanism to 
move protons across organelle membranes (Fig. 1a)'*'°. Although the 
V-ATPase is found in all eukaryotes, the Vo ring varies in subunit 
composition among lineages. In animals and most other eukaryotes, 
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Figure 1 | Structure and evolution of the V-ATPase complex. a, In S. 
cerevisiae, the V-ATPase contains two subcomplexes: the octameric V, domain is 
on the cytosolic side of the organelle membrane, and the hexameric Vp ring is 
membrane bound. Protein subunits Vma3, Vmal1 and Vmal6 are labelled and 
coloured. b, Maximum likelihood phylogeny of V-ATPase subunits Vma3, 
Vmal1 and Vmal6. All eukaryotes contain subunits 3 and 16, but Fungi also 


contain subunit 11. Circles show ancestral proteins reconstructed in this study. 
Colours correspond to those of subunits in panel a; unduplicated orthologues of 
Vma3 and Vmal1 are green. Asterisks show approximate likelihood ratios for 
major nodes: ****, >10%; ***, >107; **, >10; *, <10; ~, <2. The complete 
phylogeny is presented in Supplementary Information, section 2. 
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the ring consists of one subunit of Vma16 protein and five copies of its 
paralogue, Vma3 (Fig. 1b)'. In Fungi, the ring consists of one Vmal6 
subunit, four copies of Vma3 and one Vma11 subunit, arranged in a 
specific orientation’’. All three proteins are required for V-ATPase 
function in Fungi'*”, but the mechanisms are unknown by which 
both Vma3 and Vmall became obligate components with specific 
positional roles in the complex. 

To understand how the three-component ring evolved, we recon- 
structed ancestral Vo proteins from just before and after the increase in 
complexity, synthesized and functionally characterized them in a yeast 
genetic system, and used manipulative methods to identify the genetic 
and molecular mechanisms by which their functions changed. We first 
inferred the phylogeny and best-fit evolutionary model of the protein 
family of which Vma3, Vmall and Vmal6 are members, using the 
sequences ofall 139 extant family members available in GenBank (Sup- 
plementary Table 1). The maximum likelihood phylogeny (Fig. 1b and 
Supplementary Information, section 2) indicates that Vma3 and 
Vmal1 are sister proteins that were produced by duplication of an 
ancestral gene (Anc.3-11) before the last common ancestor of all 
Fungi (~800 million years ago’). Whether this duplication occurred 
before or after the divergence of Fungi from other eukaryotes (~1 
billion years ago”°) is not clearly resolved, although the latter scenario 
is more parsimonious. The Vma3/Vma11 and Vmal6 lineages, in turn, 
descend from an older gene duplication deep in the eukaryotic lineage 
(Fig. 1b). We used a maximum likelihood algorithm”' to infer the 
ancestral amino acid sequences with the highest probability of pro- 
ducing all the extant sequence data, given the best-fit phylogeny and 
model. We reconstructed the ancestral proteins (Anc.3-11 and Anc.16) 
that made up the ancient two-paralogue eukaryotic ring, as well as the 
duplicated subunits Anc.3 and Anc.11 from the three-component ring 
in the common ancestor of all Fungi (Supplementary Information, 
sections 3 and 4). 

To characterize the functions of these reconstructed proteins, we syn- 
thesized coding sequences and transformed them into Saccharomyces 
cerevisiae deficient for various ring components and therefore incapable 
of growth in the presence of elevated CaCl, (ref. 22). We found that the 
ancestral two-subunit ring can functionally replace the three-subunit 
ring of extant yeast. When the resurrected Anc.3-11 was transformed 
into yeast deficient for Vma3 (vma3A) or Vmal1(vma11A), growth in 
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Figure 2 | Two reconstructed ancestral Vy subunits functionally replace the 
three-paralogue ring in extant yeast. S. cerevisiae were plated in decreasing 
concentrations on permissive medium (YEPD) buffered with elevated CaCl). 
a, Expression of Anc.3-11 rescues growth in yeast that are deficient for 
endogenous subunit Vma3 (34), subunit Vmal1 (114) or both (34 114). 
Growth of wild-type (WT) yeast is shown for comparison. b, Anc.16 rescues 
growth in yeast that are deficient for subunit Vmal6 (164). c, Expression of 
Anc.3-11 and Anc.16 together rescues growth in yeast that are deficient for 
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the presence of elevated CaCl, was rescued, indicating that the func- 
tions of the present-day Vma3 and Vma11 proteins were already pre- 
sent before the duplication that generated them (Fig. 2a). Furthermore, 
Anc.3-11—unlike either of its present-day descendants—can partially 
rescue growth in yeast that are doubly deficient for both Vma3 and 
Vmall (vma34 vmal11A). The reconstructed Anc.16 also rescued 
growth in Vmal6-deficient S. cerevisiae (vmal6A) (Fig. 2b), and co- 
expression of Anc.3-11 and Anc.16 together rescued cell growth in 
vma3A vmal1A vmal6A yeast, which lack all three ring subunits 
(Fig. 2c). The ancestral genes specifically restore proper V-ATPase 
function in acidification of the vacuolar lumen (Fig. 2g). In addition, 
mutation of the ancestral subunits to remove glutamic acid residues 
known to be essential for V-ATPase enzyme function'’”* abolished 
their ability to rescue growth on CaCl, (Supplementary Information, 
section 7). These inferences about the functions of Anc.3-11 and Anc.16 
are robust to uncertainty about ancestral amino acid states. We recon- 
structed alternative versions of Anc.3-11 and Anc.16 by introducing 
amino acid states with posterior probability >0.2, but none of these 
abolished the ability of the ancestral genes to substitute functionally for 
the extant subunits (Supplementary Information, section 8). These 
results establish that during the increase in complexity, neither the Vo 
complex nor its component proteins evolved new functions required 
for growth under the conditions in which the ring is known to be 
important. 

Similar experiments with the components of the ancestral three- 
component ring show that after the duplication of Anc.3-11, its 
descendants Anc.3 and Anc.11 both became necessary for a functional 
complex because of complementary losses of ancestral functions. 
Unlike Anc.3-11, expression of Anc.3 can rescue growth and vacuole 
acidification in vma3A but not vma11A yeast, and Anc.11 can rescue 
growth in vma11A but not vma3A yeast (Fig. 2d, e, g). Furthermore, 
both Anc.3 and Anc.11 are required to rescue growth fully in 
vma3A vma11A yeast (Fig. 2f). These data indicate that after its origin 
by gene duplication, Anc.11 lost the ancestral protein’s ability to carry 
out at least some functions of Vma3, and Anc.3 lost the ancestral 
capacity to carry out those of Vmal1. 

We conjectured that Vma3 and Vmal1 evolved their specialized 
roles because they lost specific interfaces present in their ancestor that 
are required for ring assembly. Previous experiments with fusions of 


Genotype YEPD CaCl, g Quinacrine DIC 
wt | 
se) WT 
3A 
3A 
3A 3A 
174 (oe - 
= A + 
H@ @ Ole Anc.3-11 
A> a 
16A + 
3A Anc.16 : 
114 
11A + a 
8A11A — 
8A11A 3A+ 
Anc.3 
8A 11A 


Vma3, Vmall1 and Vmal6. d, Anc.11 rescues growth in vma11A but not in 
vma3A yeast. e, Anc.3 rescues growth in vma3J but not vma11A yeast. f, Anc.3 
and Anc.11 together rescue growth in vma34 vma11A mutants. 

g, Yeast expressing reconstructed ancestral subunits properly acidified the 
vacuolar lumen. Red signal shows yeast cell walls; green signal (quinacrine) 
shows acidified compartments. Yeast were visualized by differential 
interference contrast microscopy. 
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extant yeast proteins have shown that the arrangement of subunits in 
the ring is constrained by the capacity of each subunit to form specific 
interfaces (which we labelled P, Q and R) with the other subunits*. 
Specifically, Vma11 is restricted to a single position between Vmal6 
and Vma3, because its clockwise interface can participate only in 
interface R with Vma16, and its anticlockswise interface can participate 
only in interface P with the clockwise side of Vma3 (Fig. 3). By contrast, 
copies of Vma3 occupy several positions in the ring, because they form 
interface P with other copies of Vma3 or Vma11, as well as interface Q 
with Vmal6. However, Vma3 cannot form interface R with Vmal6. As 
a result, both Vma3 and Vma11 are required in extant yeast to form a 
complete ring with Vmal6. 

To determine whether interaction interfaces were lost during evolu- 
tion, we engineered fusions of ancestral ring proteins to assess the 
capacity of each to form the specific interfaces with other subunits 
that are required for a functional complex. Because Anc.3-11 can 
complement the loss of both Vma3 and Vmal1, we proposed that 
the Anc.3-11 subunit could participate in all three specific interaction 
interfaces, and that these capacities were then partitioned between 
Anc3 and Ancl1 after the duplication of Anc.3-11 (Fig. 3a, b). To test 
this hypothesis, we created six reciprocal gene fusions between yeast 
subunit Vmal6 and ancestral subunits Anc.3-11, Anc.3 and Anc.11 
(Fig. 3c and Supplementary Information, section 9). Each fusion con- 
strains the structural position of subunits relative to subunit Vmal6, 
making it possible to determine which arrangements yield a functional 
ring. As predicted, Anc.3-11 functioned on either side of Vmal6 
(Fig. 3d), indicating that it could form all three interfaces P, Q and 
R. By contrast, Anc.3 functioned when constrained to participate in 
interface Q with Vmal6 and interface P with Vma3; however, ring 
function was lost when Anc.3 was constrained to form interface R with 
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Figure 3 | Increasing complexity by complementary loss of interactions in 
the fungal Vo ring. a, Model of the ancestral three-paralogue ring, arranged as 
in extant yeast**. Unique intersubunit interfaces are labelled P, Q and R. Subunits 
are colour-coded as in Fig. 1. b, Model of the ancestral two-paralogue ring, before 
duplication of Anc.3-11. ¢, To constrain the location of specific subunits, gene 
fusions were constructed by tethering an ancestral subunit to either the amino- 
or carboxy-terminal side of yeast Vmal6. Roman numerals indicate the 
locations of transmembrane helices (I, II, HI, IV and V)*. d-f, Growth assays of 
yeast with fused Vo subunits identify the interfaces that ancestral subunits can 
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Vmal6 (Fig. 3e). Anc.11 functioned when constrained to participate in 
interface R with Vma16 and interface P with Vma3, but ring function 
was lost when Anc.11 was constrained to participate in interface Q 
with Vmal6 and interface P with Vma3. This result indicates that 
Anc.11 lost the capacity to form one or both of these interfaces during 
its post-duplication divergence from Anc.3-11 (Fig. 3f). 

Taken together, these data indicate that the specificity of the ring 
arrangement and the obligate roles of Vma3 and Vmal1 evolved by 
complementary loss of asymmetric interactions with other members 
of the ring (Fig. 3g, h). Before Anc.3-11 duplicated, the protein ring 
contained copies of only undifferentiated subunit Vma3/Vmal1 and 
subunit 16. Immediately after Anc.3-11 duplicated, the two descend- 
ant subunits must have been functionally identical, so the protein ring 
could have assembled with many possible combinations of the two 
descendants, including copies of only one of the descendant proteins. 
This flexibility disappeared when Anc.3 lost the ancestral interface that 
allowed it to interact with the anticlockwise side of Vmal6, and Anc.11 
lost the ability to interact with the clockwise side of Vmal6 and/or the 
anticlockwise side of Vma3. These complementary losses are sufficient 
to explain the specific arrangement of contemporary subunits in 
reconstructed and present-day fungal transmembrane rings. 

To establish the genetic basis for the partitioning of the functions of 
Anc.3-11 between Vma3 and Vmal1, we introduced historical muta- 
tions into Anc3.11 by directed mutagenesis and determined whether 
they recapitulated the shifts in function that occurred during the evolu- 
tion of Anc.3 and Anc.11. The two phylogenetic branches leading from 
Anc.3-11 to Anc.3 and to Anc.11 contain 25 and 31 amino acid sub- 
stitutions, respectively, but only a subset of these are strongly con- 
served in subunits Vma3 or Vmal1 from extant Fungi (Fig. 4a). We 
introduced each of these ‘diagnostic’ substitutions into Anc.3-11 and 
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form. For each experiment, expressed Vo subunits are listed. Tethered subunits 
are in brackets and connected by a thick line. Cartoons show the constrained 
location of the tethered subunit relative to Vmal6. Anc.3-11 can function on 
either side of Vmalé6 (d). Anc.3 can function only on the clockwise side of 
Vmal6 (e). Anc.11 can function only on the anticlockwise side of Sc.16 

(f). g, Interfaces that are formed by Vo subunits before and after duplication and 
complementary loss of interfaces, based on the data in panels d-f. Red crosses 
indicate lost interfaces. h, Schematic of interfaces formed by Anc.3-11 that were 
lost in Anc.3 and Anc.11, based on data in panels d-f. 
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Figure 4 | Genetic basis for functional differentiation of Anc.3 and Anc.11. 
a, Experimental analysis of historical amino acid replacements. The table lists 
replacements that occurred on the branches leading from Anc.3-11 to Anc.11 
(yellow) or to Anc.3 (blue) and that were subsequently conserved. Each derived 
residue was introduced singly into Anc.3-11; the variant genes were 

transformed into S. cerevisiae, and growth was assayed on elevated CaCl). The 


experimentally evaluated whether they recapitulated the loss by Anc.3 
or Anc.11 of the capacity to complement Vma gene deletions. We 
found that a single amino-acid replacement that occurred on the 
branch leading to Anc.11 (V15F) abolished the capacity of Anc.3-11 
to function as subunit 3; it also enhanced the ability of Anc.3-11 to 
function as subunit 11 (Fig. 4b). V15F is located in transmembrane 
helix I, which participates in the P interface that our experiments 
indicate may have been lost on the same branch (Fig. 3 and 
Supplementary Information, section 4). Conversely, a single historical 
replacement (M22I) on the branch leading to Anc.3 radically reduced 
the capacity of Anc.3-11 to function as subunit 11 (Fig. 4c). M22I is 
also in transmembrane helix I, which participates in formation of the R 
interface that was lost on this branch (Fig. 3 and Supplementary 
Information, section 4). The Anc.3-11 M22I mutant retains some of 
the capacity of the ancestral protein to rescue growth in the Vmal1- 
deficient background, suggesting that other mutations also contributed 
to the functional evolution of Vma3. One other historical mutation 
(N88T) on this branch also impaired the capacity of Anc.3-11 to func- 
tion as subunit 11, but it reduced the capacity of the protein to function 
as Vma3 as well, suggesting that epistatic interactions with other residues 
allow this mutation to be tolerated in Anc.3 and its descendants. Several 
of the replacements on the branch leading to Anc.11 show a similar 
pattern, reducing the capacity of the protein to replace Vma3, indi- 
cating that these historical replacements function better together than 
in isolation. 

How complexity and novel functions evolve has been a longstand- 
ing question in evolutionary biology’, because mutations that com- 
promise existing functions are far more frequent than those that 
generate new ones”. Our results indicate that the architectural com- 
plexity of molecular assemblies can evolve because of a few simple, 
relatively high-probability mutations that degrade ancestral interfaces 
but leave other functions intact. The specific roles of subunits Vma3 
and Vmal1 seem to have been acquired when duplicated genes lost 
some, but not all, of the capacity of the ancestral protein to participate 
in interactions with copies of itself and another protein required for 
proper ring assembly. Because complementary losses occurred in both 
lineages, the two descendant subunits became obligate components, 
and the complexity of the ring increased. It is possible that specialization 
of the duplicated subunits allowed increases in fitness, but genome-wide 
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table shows growth semiquantitatively from zero (none) to wild type 
(++++++). Bold mutations entirely or partly recapitulate the functional 
evolution of Anc.11 and Anc.3. b, Replacement V15F abolishes the capacity of 
Anc.3-11 to function as subunit 3 and enhances the capacity of Anc.3-11 to 
function as subunit 11. c, Replacement M22] impairs the capacity of Anc.3-11 
to function as subunit 11 without affecting its capacity to function as subunit 3. 


interaction screens and the phenotype of vma11A yeast provide no 
evidence that Vma11 evolved novel functions in addition to those that 
it inherited from Anc.3-11 in the Vo ring”. 

We are aware of no other mechanistic analyses of a molecular 
machine’s evolutionary trajectory, so the generality of our observations 
is unknown. By definition, however, all molecular machines involve 
differentiated parts in specific spatial orientations, and many such 
complexes are entirely or partially composed of paralogous proteins” *. 
In the evolution of any such assembly, additional paralogues could 
become obligate components because of gene duplication” and sub- 
sequent mutations that cause specific interaction interfaces among 
them to degenerate. 

This view of the evolution of molecular machines is related to recent 
models that explain other biological phenomena—such as the reten- 
tion of large numbers of duplicate genes and mobile genetic elements 
within genomes—as the product of degenerative processes acting on 
modular biological systems’’. Although mutations that enhanced the 
functions of individual ring components may have occurred during 
evolution, our data indicate that simple degenerative mutations are 
sufficient to explain the historical increase in complexity of a crucial 
molecular machine. There is no need to invoke the acquisition of 
‘novel’ functions caused by low-probability mutational combinations. 


METHODS SUMMARY 


Ancestral protein sequences were inferred using maximum_-likelihood phylogenetics 
from an alignment of 139 protein sequences of extant subunits 3, 11 and 16 from 
Amoebozoa, Apicomplexa, Metazoa, Choanoflagellida and Fungi. Ancestral genes 
were synthesized, cloned into yeast expression vectors and tested for complementa- 
tion in various S. cerevisiae mutants. V-ATPase function was assayed by growth tests 
on medium buffered with CaCl, as described previously*’. Steady-state levels of 
Vph1 were determined by western blot. Quinacrine staining and Vph1-GFP (green 
fluorescent protein) fusion constructs were visualized by fluorescence microscopy. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


In silico reconstruction of ancestral protein sequences. V,) complex subunits 
Vma3, Vmall and Vmal6 are sometimes referred to as subunits c, c’ and c” in the 
literature. We searched GenBank for all eukaryote V-ATPase Vo ring sequences 
(Supplementary Information, section 1). Our query returned subunit 3, 11 and 16 
protein sequences for 26 species in Fungi, and subunit 3 and 11 sequences for 35 
species in Metazoa, Amoebozoa and Apicomplexa. We aligned the sequences 
using PRANK v0.081202 (refs 32, 33). We selected the best-fit model (WAG with 
gamma-distributed rate variation and a proportion of invariant sites) using the 
Akaike Information Criterion as implemented in PROTTEST****. With this 
model, we used PhyML v3.0 to infer the maximum likelihood topology, branch 
lengths and model parameters**. We optimized the topology using the best result 
from nearest-neighbour interchange and subtree pruning and regrafting; we 
optimized all other free parameters using the default hill-climbing algorithm in 
PhyML. Phylogenetic support was calculated as the approximate likelihood ratio 
(converted from the approximate likelihood ratio statistic (aLRS) for branches 
reported by PhyML, using the equation aLR = exp[aLRS/2]) and as the likelihood 
ratio-based SH-like branch supports*’. Nematoda subunit 3 and 11 sequences 
were connected by a very long branch basal to the Chromalveolata lineages. 
This result is inconsistent with the expectation that Nematoda are animals**, so 
we excluded Nematoda data from further downstream analysis. 

We inferred ML ancestral states and posterior probability distributions at each 

site for all ancestral nodes in the ML phylogeny using our own set of Python scripts, 
called Lazarus, which wraps PAML version 4.1 (ref. 39). Lazarus parsimoniously 
places ancestral gap characters according to Fitch’s algorithm”. We characterized 
the overall support for Anc.3-11, Anc.16, Anc.3 and Anc.11 by binning the 
posterior probability of the ML state at each site into 5%-sized bins and then 
counting the proportion of total sites within each bin (Supplementary Informa- 
tion, section 2). 
Robustness to alignment uncertainty. To assess the robustness of ancestral 
reconstructions to alignment uncertainty, we performed alignment using four 
algorithms: CLUSTAL version 2.0.10 (ref. 41), MUSCLE v3.7 (ref. 42), AMAP 
v2.2 (ref. 43), and PRANK v0.081202 (refs 32, 33). We then inferred the ML 
phylogeny and branch lengths for each alignment, using the methods described 
above. The resultant alignments varied in length from 347 sites (using CLUSTAL) 
to 683 sites (using PRANK), but all four alignments yielded the same ML topology 
with nearly identical ML branch lengths. 

To determine which alignment algorithm yields the most accurate ancestral infer- 
ences under V-ATPase phylogenetic conditions, we simulated sequences across the 
V-ATPase ML phylogeny using insertion and deletion rates ranging from 0.0 to 0.1 
indels per site. For each indel rate, we generated ten random unique indel-free 
ancestral sequences of 400 amino acids in length and then used INdelible** to 
simulate the ancestral sequence evolving along the branches of our ML phylogeny 
under the conditions of the WAG model with a proportion of invariant sites (+1) 
and a discrete gamma distribution of evolutionary rates (+G) with indel events 
randomly injected according to the specified indel rate. The size of each indel event 
was drawn from a Zipfian distribution with coefficient equal to 1.1 and the maximum 
length limited to 10 amino acids. We aligned the descendant sequences of each 
replicate using AMAP, CLUSTAL, MUSCLE and PRANK. For each alignment, 
we inferred the ML topology, branch lengths and model parameters using the 
methods described above. We used Lazarus to reconstruct all of the ancestral states, 
and queried Lazarus for the most-recent shared ancestor for opisthokont subunit 
3/11 and opisthokont subunit 16 sequences. We measured the error of ancestral 
reconstructions as the proportion of ancestral sites that incorrectly contained an 
indel character (see Supplementary Information, section 6). 

Plasmids and yeast strains. Bacterial and yeast manipulations were performed 
using standard laboratory protocols for molecular biology**. Plasmids that were 
used are listed in Supplementary Information, section 5. Ancestral sequences 
(pGF140, pGF139, pGF506 and pGF508) were synthesized by GenScript with a 
yeast codon bias. Triple haemagglutinin epitope tags were included before each 
stop codon. The Anc.3-11, Anc.16, Anc.3 and Anc.11 genes were subcloned to 
single-copy, CEN-based yeast vectors. The ADH terminator sequence (247 base 
pairs (bp)) and Nat’ drug resistance marker*® were amplified using polymerase- 
chain-reaction (PCR) containing 40-bp tails homologous to the 3’ end of each 
coding region and vector sequence. Vectors were gapped, co-transformed into 
SF838-1Do yeast with PCR fragments and cells were selected for Nat’. A second 
round of in vivo ligation was used to place the ancestral genes under 500 bp of the 
VMA3 or VMA16 promoters to create pGF140 and pGF139, respectively. For 
vectors pGF240, pGF241, 1lpGF252, pGF253, pGF503-pGF508, pGF510, 
pGF512-pGF515, pGF517-pGF519, pGF521, pGF523, pGF528, pGF529, 
pGF531, pGF534—pGF537 and pGF542, the relevant locus (Anc.3-11, Anc.16 or 
Anc.3) was PCR amplified with 5’ and 3’ untranslated flanking sequence and 
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cloned into pCR4Blunt-TOPO (Invitrogen). When necessary, a modified 
Quikchange protocol” was used to introduce point mutations before the gene 
was subcloned into a yeast vector (pRS316 or pRS415). To generate pGF502, 
sequence from codon 31 to the stop codon of Anc.16 was amplified with the 
ADH::Nat'’ cassette from pGF139, cloned into TOPO, and in vivo ligated down- 
stream of the VMA16 promoter (including a start codon) in pRS415. 

A triple-fragment in vivo ligation was used to generate pGF646—pGF651. 
Gapped vector containing the VMA16 promoter was transformed into yeast with 
two PCR fragments of the ring genes to be fused. For pGF646, the coding region of 
(1) VMA16 (without codons 2-41) and (2) the coding region of Anc.11 (without 
codons 2-5 were amplified by PCR. The proteolipid on the C-terminal portion of 
the gene fusion also contained the ADH terminator and Nat’ cassette; the amp- 
lified products contained PCR tails with homology to link the genes to both the 
gapped vector and to each other. Gene fusions were modelled after the experi- 
mental design of Wang et al. (2007)** in which the lumenal protein sequence 
linking the two proteolipids was designed to be exactly 14 amino acids. To meet 
these criteria, additional amino acids were inserted into the following vectors 
linking the two subunits: pGF646 (Thr-Arg-Val-Asp), pGF648, pGF650 (Thr- 
Arg), pGF649, pGF651 (Gly-Ser). 

Yeast strains that were used are listed in Supplementary Information, section 2. 
Strains containing deletion cassettes other than Kan* *° were constructed by PCR 
amplifying the H: yg" or Nat’ cassette from pAG32 or pAG25, respectively, with 
primer tails with homology to flanking sequences to the VMAI1 or VMA16 loci. 
114::Kan® and 164::Kan* strains (SF838-1De) were transformed with the Hyg® 
and Nat® PCR fragments, respectively, and selected for drug resistance. The 
114::Hyg® locus was amplified and transformed into LGY113 (to create 
LGY125) and LGY115 (to create LGY124). This was repeated with the 
16A::Nat® locus to create LGY139 and LGY143. 

Yeast Growth Assays. Yeast were grown in liquid culture, diluted fivefold and 
spotted onto YEPD media buffered to pH 5.0 or yeast extract peptone dextrose 
media containing 25 mM (Figs 2, 3, 4) or 30 mM CaCl, (Fig. 2f). 

Whole-cell extract preparation and immunoblotting. Yeast extracts and west- 
ern blots were performed as previously described*'. Antibodies that were used in 
this study included monoclonal primary anti-HA (Sigma-Aldrich), anti-Dpm1 
(5C5; Invitrogen) and secondary horseradish-conjugated anti-mouse antibody 
(Jackson ImmunoResearch Laboratory, West Grove, Pennsylvania, USA). 
Fluorescence microscopy. Staining with quinacrine was performed as previously 
described*'. The cell wall (shown in red) was visualized using concanavalin A 
tetramethylrhodamine (Invitrogen). Microscopy images were obtained using an 
Axioplan 2 fluorescence microscope (Carl Zeiss). A X 100 objective, AxioVision 
software (Carl Zeiss) and Adobe Photoshop Creative Suite (v. 8.0) were used. 
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Human immunodeficiency virus (HIV) has a small genome and 
therefore relies heavily on the host cellular machinery to replicate. 
Identifying which host proteins and complexes come into physical 
contact with the viral proteins is crucial for a comprehensive 
understanding of how HIV rewires the host’s cellular machinery 
during the course of infection. Here we report the use of affinity 
tagging and purification mass spectrometry'* to determine 
systematically the physical interactions of all 18 HIV-1 proteins 
and polyproteins with host proteins in two different human cell 
lines (HEK293 and Jurkat). Using a quantitative scoring system 
that we call MiST, we identified with high confidence 497 HIV- 
human protein-protein interactions involving 435 individual 
human proteins, with ~40% of the interactions being identified 
in both cell types. We found that the host proteins hijacked by HIV, 
especially those found interacting in both cell types, are highly 
conserved across primates. We uncovered a number of host com- 
plexes targeted by viral proteins, including the finding that HIV 
protease cleaves eIF3d, a subunit of eukaryotic translation ini- 
tiation factor 3. This host protein is one of eleven identified in this 
analysis that act to inhibit HIV replication. This data set facilitates 
a more comprehensive and detailed understanding of how the host 
machinery is manipulated during the course of HIV infection. 

A map of the physical interactions between proteins within a par- 
ticular system is necessary for studying the molecular mechanisms that 
underlie the system. The analysis of protein-protein interactions 
(PPIs) has been successfully accomplished in different organisms using 
a variety of technologies, including mass spectrometry approaches’** 
and those designed to detect pairwise physical interactions, including 
the two-hybrid yeast system** and protein-fragment complementa- 
tion assays’. Although two-hybrid methodologies have been used to 
systematically study host-pathogen interactions*”, so far no systematic 
affinity tagging/purification mass spectrometry (AP-MS) study has 
been carried out on any host-pathogen system. Here we have targeted 
HIV-1 for such an analysis, uncovering a wide variety of host proteins, 
complexes and pathways that are hijacked by the virus during the 
course of infection. 

We aimed to identify host proteins associated with HIV-1 proteins 
systematically and quantitatively using an AP-MS approach”’. To this 
end, we cloned the genes corresponding to all 18 HIV-1 proteins and 
polyproteins, including the accessory factors (Vif, Vpu, Vpr and Nef), 
Tat, Rev, the polyproteins (Gag, Pol and Gp160) and the corresponding 
processed products (MA, CA, NC and pé6; PR, RT and IN; and Gp120 
and Gp4l, respectively) (Supplementary Fig. 1 and Supplementary 
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Table 1). Each clone was fused to a purification tag (consisting of 
2Strep and 3X Flag) and transiently transfected into HEK293 cells; 
each also was used to generate stably expressed, tetracycline-inducible, 
affinity-tagged versions of the proteins in Jurkat cells (Fig. la and 
Supplementary Fig. 2). Following multiple purifications of each factor 
from both cell lines, the material on the anti-FLAG or Strep-Tactin 
beads, as well as the eluted material, was analysed by mass spectrometry 
(Fig. la and Supplementary Table 2). Finally, an aliquot of each purified 
factor was subjected to SDS-polyacrylamide gel electrophoresis, stained 
(Supplementary Fig. 3) and subjected to analysis by mass spectrometry. 

For each HIV factor, we identified co-purifying host proteins that 
were reproducible regardless of the protocol used (Supplementary 
Figs 4, 5 and 7 and Supplementary Data 1). Several scoring systems 
can quantify PPIs from AP-MS proteomic data sets, including PE’, 
CompPASS* and SAINT"’. For this data set, we devised a scoring 
system particularly suited for identifying AP-MS-derived host-pathogen 
PPIs, which we call MiST (mass spectrometry interaction statistics). The 
MiST score is a weighted sum of three measures: protein abundance 
measured by peak intensities from the mass spectrum (abundance); 
invariability of abundance over replicated experiments (reproducibility); 
and uniqueness of an observed host-pathogen interaction across all viral 
purifications (specificity) (Fig. 1b and Supplementary Methods). These 
three metrics are summed by principal component analysis into a 
composite score (Fig. 1c and Supplementary Data 2). By comparing 
our dataset with a benchmark of well-characterized HIV-human PPIs 
(Supplementary Table 3), analysis of the MiST scoring system revealed 
superior performance on our data set when compared to CompPASS 
or SAINT (Supplementary Fig. 6) (and comparable performance using 
other data sets (Supplementary Fig. 8)) and allowed us to define a MiST 
cut-off of 0.75, corresponding to ~4% of all detected interactions. To 
estimate how many interactions would exceed this threshold by chance, 
we randomly shuffled our data set 1,000 times. A random MiST score of 
0.75 or greater was assigned to an interaction ten times less frequently 
than we saw among the MiST scores for the real data, and the 
probability of an interaction assignment with a random MiST score 
greater than 0.75 was 2.5 X 10 * (Fig. 1d). 

At the MiST threshold of 0.75, the number of host proteins we found 
associated with each HIV protein ranged from 0 (CA and p6) to 63 
(Gp160) (Fig. le). In total, we observed 497 different HIV-human 
PPIs (347 and 348 identified from HEK293 cells and Jurkat cells, 
respectively) (Supplementary Data 3). We detected 196 interactions 
(~40%) in both cell types; 150 and 151 were specific to the HEK293 
cells and the Jurkat cells, respectively (Fig. le). Only some of these 
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Figure 1 | Affinity purification of HIV-1 proteins, analysis and scoring of 
mass spectrometry data. a, Flowchart of the proteomic AP-MS used to define 
the HIV-host interactome. PAGE, polyacrylamide gel electrophoresis. SF, 

2 Strep-3 Flag affinity tag. b, Data from AP-MS experiments are organized 
in an interaction table with cells representing amount of prey protein purified 
(for example spectral counts or peptide intensities). Three features are used to 
describe bait-prey relationships: abundance (blue), reproducibility (the 
invariability of bait—prey pair quantities; red) and specificity (green). c, All bait- 
prey pairs are mapped into the three-feature space (abundance, reproducibility 
and specificity). The MiST score is defined as a projection on the first principal 
component (red line). All interactions, represented as nodes, above the defined 
threshold (0.75) are shown in red. This procedure separates the interactions 
more likely to be biologically relevant (for example Vif-ELOC (ELOC also 
known as TCEB1), Vpr-VPRBP and Tat-CCNT1) from the interactions that 
are likely to be less relevant owing to low reproducibility (Vpu-ATP4A) or 


specificities could be explained by differential gene expression in the 
two cell lines (Supplementary Fig. 9). Using antibodies against 26 of 
the human proteins, and affinity-tagged versions of an additional 101, 
we could confirm 97 of the 127 AP-MS derived HIV-human PPIs 
using co-immunoprecipitation/western blot analysis (76% success 
rate) (Supplementary Figs 10 and 11), suggesting that we derived a 
high-quality physical interaction data set. 

We next analysed the functional categories of host proteins associated 
with each HIV protein, and in doing so uncovered many expected 
connections. These included an enrichment of host factors involved 
in transcription physically linked to the HIV transcription factor Tat 
and an enrichment of host machinery implicated in the regulation of 
ubiquitination associating with Vpu, Vpr and Vif, HIV accessory 
factors that hijack ubiquitin ligases'* (Fig. 1f and Supplementary 
Data 4). When we considered domain types instead of whole proteins 
(Fig. 1g and Supplementary Table 4), we found that host proteins 
interacting with IN are enriched in 14-3-3 domains, which generally 
bind phosphorylated regions of proteins’’, and that proteins contain- 
ing B-propellers have a higher propensity for binding to Vpr (for 
additional domain enrichment analysis, see Supplementary Fig. 12). 
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specificity (RT-HSP71 (HSP71 also known as HSPA1A) and NC-RL23A 
(RL23A also known as RPL23A)).d, The histogram of MiST scores (real data) is 
compared with a randomized set of scores obtained from randomly shuffling 
the bait-prey table (simulated data). The MiST score threshold (0.75) was 
defined using a benchmark (Supplementary Table 3) whereby the predictions 
are enriched for these interactions by a factor of at least ten relative to random 
predictions (as well as through ROC (receiver operating characteristic) and 
recall plots (Supplementary Fig. 6)). e, Bar graph of the number of host proteins 
we found interacting with each HIV factor (MiST score, >0.75). The cell type in 
which the interaction was found is represented in blue (HEK293 only), yellow 
(Jurkat only) or red (both). f, g, Heat maps representing enriched biological 
functions (f) and domains (g) from the human proteins identified as interacting 
with HIV proteins (Supplementary Methods). ER, endoplasmic reticulum; 
mRNA, messenger RNA; tRNA, transfer RNA. TPR, tetratricopeptide repeat; 
HTH, helix-turn-helix; SPFH, stomatin-prohibitin—flotillin-HflK/C. 


These domain analyses could facilitate future structural modelling of 
HIV-human PPIs. 

Next we compared our data to other HIV-related data sets, includ- 
ing previously published HIV-human PPIs and host factors impli- 
cated in HIV function from genome-wide RNA interference (RNAi) 
screens. For example, the VirusMint database’* contains 587 HIV- 
human literature-curated PPIs (Supplementary Data 5), which are 
mostly derived from small-scale, targeted studies. Although the over- 
lap between the 497 interactions identified in this work and those in 
VirusMint is statistically significant (P = 8 X 10°), it corresponds to 
only 19 PPIs (Fig. 2a and Supplementary Table 5). However, a greater 
overlap exists, one that remains statistically significant, when inter- 
actions below the MiST threshold of 0.75 are considered using a sliding 
cut-off (for example, at a MiST score of 0.2 there exists an overlap of 67 
PPIs (P=1X 10 %); Fig. 2c, red lines, and Supplementary Data 6). 
This overlap indicates that we have indeed identified many inter- 
actions that have been previously reported. However, it is likely that 
the higher scoring interactions identified here have a greater chance of 
being biologically relevant with respect to HIV function than do many 
of those in VirusMint. 
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Figure 2 | Comparison of PPI data with other HIV data sets. a, Overlap of 
the 497 HIV-human PPIs with the 587 PPIs reported in VirusMint 
(Supplementary Table 5). b, Overlap of the 435 human proteins with the genes 
identified in four HIV-dependency RNAi screens’*""* (Supplementary Table 6). 
c, Number of interactions overlapping with VirusMint (solid red line) and 
proteins with RNAi screens (solid blue line) as functions of the MiST cut-off. 
The P values of the overlap are represented as dashed lines using the same 
colours (Supplementary Data 6 and 8). d, Comparative genomics analysis of 
divergence patterns between human and rhesus macaque reveals strong 
evolutionary constraints on human proteins binding to HIV proteins. The x 
and y axes represent P values for the synonymous (dS) and non-synonymous 
(dN) rates of evolution (Supplementary Methods). Horizontal and vertical 
dotted lines are drawn at 0.5% to indicate the Bonferroni significance threshold 
for each axis. For the VirusMint data, the significance of w (dN/dS) is primarily 
driven by higher rates of synonymous evolution. U, union; (-), intersection; P,,, 
bootstrap-based P value for «. 


Recently, four RNAi screens identified host factors that have an 
adverse effect on HIV-1 replication when knocked down'*"*. In total, 
1,071 human genes were identified in these four studies (Supplementary 
Data 7), 55 of which overlap with the 435 proteins (P = 2.7 X 10 1°; 
Fig. 2b, Supplementary Fig. 12 and Supplementary Table 6). Again, this 
overlap increases (as does its statistical significance) if we consider 
proteins participating in HIV-human PPIs with MiST scores below 
0.75 (Fig. 2c, blue lines, and Supplementary Data 8). 

To identify the evolutionary forces operating on host proteins inter- 
acting with HIV-1, we performed a comparative genomics analysis of 
divergence patterns between human and rhesus macaque. The proteins 
identified in both HEK293 and Jurkat cell lines had stronger signatures 
of evolutionary constraint than those identified exclusively in one cell 
line or in VirusMint (Fig. 2d). Points in the lower-right quadrant of 
Fig. 2d show signatures of strong purifying selection, whereas the 
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upper-right quadrant shows signatures more consistent with neutral 
evolution. This observation suggests that the PPIs identified in our study, 
especially the ones identified in both cell types, are more physiologically 
relevant to mammalian evolution than those reported in VirusMint. 
We next plotted the 497 HIV-human interactions identified in this 
study in a network representation (Fig. 3) containing nodes corres- 
ponding to 16 HIV (yellow) and 435 human factors that were derived 
from the HEK293 cells (blue), Jurkat cells (red) or both. We also 
introduced 289 interactions between human proteins (black edges) 
derived from several databases” (Supplementary Data 9). These 
human-human interactions helped to identify many host complexes, 
including several that have been previously characterized (see 
Supplementary Information for a detailed discussion of the HIV- 
human interaction data sets). Ultimately, all data will be accessible 
for searching and comparison to other HIV-related data sets using 
the web-based software GPS-PROT”’ (http://www.gpsprot.org/). 
Notably, we found that Pol and PR, which we needed to make 
catalytically inactive (Supplementary Fig. 1), bound the translational 
initiation complex eIF3, a 13-subunit complex (eIF3a to eIF3m). We 
detected 12 of the subunits bound to Pol and/or PR, except eIF3j, which 
is only loosely associated with the complex” (Fig. 4a). Even though PR 
is the smallest of the pol-encoded proteins, we find it associated with the 
greatest number of host factors (Fig. 4a). To determine whether com- 
ponents of the translation complex are substrates for PR, FLAG-tagged 
versions of ten eIF3 subunits were individually co-transfected, each 
with a small amount of active HIV-1 PR, into HEK293 cells. The cell 
lysates were analysed by western blotting and only eIF3d was found to 
be cleaved (Fig. 4b). Purification of tagged versions of the amino and 
carboxy termini of cleaved eIF3d revealed that only the N terminus of 
114 amino-acid residues associates with the eIF3 complex (Supplemen- 
tary Table 7). The cleavage occurred with an efficiency similar to that of 
the processing of the natural PR substrate Gag (Fig. 4c), whereas two 
cellular proteins previously described to be cleaved by HIV PR, 
PAPBC1” and BCL2™, were cleaved only at higher PR concentrations 
or not at all, respectively. To confirm this result in vitro, we incubated 
purified human eIF3 with active PR, resulting in the removal of a 
70-kDa band and the appearance of a ~60-kDa protein product 
(Fig. 4d). Analysis of the cleaved product by N-terminal sequencing 
revealed a cleavage of eIF3d between Met 114 and Leu 115, which 
corresponds to the consensus sequence for HIV-1 protease” and falls 
within the RNA-binding domain (RRM) of eIF3d (ref. 26; Fig. 4d). 
Next we used four to six short interfering RNAs against different eIF3 
subunits in HIV infectivity assays (Fig. 4e, f, Supplementary Fig. 14 and 
Supplementary Table 8). Using a fusion of HIV with vesicular stomatitis 
virus glycoprotein (VSV-G), which only allows for a single round of 
replication, knockdown of eIF3d, but not other eIF3 subunits, resulted 
in an increase in infectivity (Fig. 4e), suggesting that this factor acts in 
early stages of infection. In assays requiring multiple rounds of HIV 
infection, knockdown of eIF3d, elF3e and eIF3f enhanced HIV NL4.3 
infectivity by a factor of three to five, whereas inhibition of eIF3c, eIF3g 
and eIF3i had no promoting effect (Fig. 4f). Consistent with these 
results, a previous overexpression screen for factors that restrict 
HIV-1 replication identified eIF3f as the most potent inhibitory 
clone”’. Furthermore, using assays monitoring both early and late 
products we found that knockdown of eIF3d results in an increase in 
accumulation of reverse transcription product (Fig. 4g and Sup- 
plementary Fig. 15). This suggests that eIF3 does in fact have a role 
in the early stages of infection, perhaps by binding to the viral RNA 
through the RNA-binding domain in eIF3d, and thus inhibiting RT, an 
effect that is overcome by PR cleavage of elF3d (Supplementary Fig. 16). 
These results suggest that our data set will be enriched not only for 
host proteins the virus requires for efficient replication (Fig. 2b, c), 
but also those that have an inhibitory role during infection. Indeed, 
we have found that an additional ten factors from our list of inter- 
actors, when knocked down by RNAi, produce an increase in HIV 
infection (Supplementary Figs 17-19, Supplementary Tables 12 
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Figure 3 | Network representation of the HIV-human PPIs. In total, 497 
HIV-human interactions (blue) are represented between 16 HIV proteins and 
435 human factors. Each node representing a human protein is split into two 
colours and the intensity of each colour corresponds to the MiST score from 


and 13 and Supplementary Methods). Knockdown of two of these, 
DESP and HEATI, also resulted in an increase in HIV integration 
(Supplementary Fig. 20 and Supplementary Table 14), consistent with 
their physical association with IN. 

As well as performing the systematic AP-MS study reported here, 
we explored in further detail the biological significance of two newly 
identified HIV-human interactions: HIV protease targeting a com- 
ponent of eIF3 that is inhibitory to HIV replication; and CBF-f, a new 
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interactions derived from HEK293 (blue) or Jurkat (red) cells. Black edges 
correspond to interactions between host factors (289) that were obtained from 
publicly available databases; dashed edges correspond to interactions also 
found in VirusMint™. 


component of the Vif-CUL5 ubiquitin ligase complex required for 
APOBEC3G stability and HIV infectivity’. Further work will be 
required to determine whether, how and at what stage of infection 
the remaining host factors impinge on HIV function. Ultimately, 
our analysis of the host factors co-opted by different viruses using 
the same proteomic pipeline will allow for the identification of protein 
complexes routinely targeted by different pathogens, which may rep- 
resent better therapeutic targets for future studies. 
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Figure 4 | eIF3d is cleaved by HIV-1 PR and inhibits infection. a, MiST 
scores for eIF3 subunits associated with PR (right) and Pol (left) in HEK293 and 
Jurkat cells. Sizes of the proteins and numbers of significant interactions (MiST 
score, >0.75) detected for Pol and its subunits are shown below, as is a modular 
representation of the e[F3 complex”. The cleaved subunit, eIF3d, is in red. 

b, Western blot of HEK293 cell lysate expressing FLAG-tagged eIF3 subunits in 
the absence (—) or presence (+) of active PR probed with an anti-FLAG 
antibody. c, HEK293 cells were co-transfected with Gag, or FLAG-tagged 
eIF3d, PABPC1, BCL2 and increasing amounts of PR. Cell lysates were probed 
against Gag (upper panel), FLAG-tagged eIF3d (middle panel) or tubulin as 
control (lower panel). d, Silver stain of purified eIF3 complex incubated with 
recombinant HIV-1 PR. The residues corresponding to the elF3d cleavage site 


METHODS SUMMARY 


More details on experimental assays, plasmid constructs, sequences, cell lines, 
antibodies and computational analysis are provided in Supplementary Methods. 
Briefly, affinity tagging and purification was carried out as previously described? 
and the protein samples were analysed on a Thermo Scientific LTQ Orbitrap XL 
mass spectrometer. For the evolutionary analysis, genome-wide alignments to 
rhesus macaque were downloaded from the University of California, Santa Cruz 
genome browser (http://genome.ucsc.edu/) and evolutionary rates for each group 
of genes considered were measured using the synonymous and non-synonymous 
rates of evolution. For the in vitro protease assay, maltose binding protein (MBP)- 
tagged PR was expressed in BL21 (Gold) DE3 cells in the presence of 100 1M 
Saquinavir and purified on an MBP trap column. Purified eIF3 was obtained from 
J. Cate (University of California, Berkeley). For the infection assays, HeLa P4.R5 
cells were transfected with short interfering RNAs and after 48h infected with 
pNL4-3 or a pNL4-3-derived VSV-G-pseudotyped reporter virus. Infection levels 
were determined by luminescence read-out. 
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(red) is located within the RNA-binding domain”. e, f, HeLa-derived P4/R5 
MAGI cells were transfected with two different short interfering RNAs 
(siRNAs) targeting individual subunits of the eIF3 complex (Supplementary 
Tables 7 and 9) and subsequently infected with either a pNL4-3-derived, VSV- 
G-pseudotyped, single-cycle virus (HIV-VSV-G) (e) or wild-type pNL4-3 
(f). NC, negative control. g, Early (left) and late (right) HIV-1 DNA levels 
measured by quantitative PCR amplification in cells transfected with two 
independent eIF3d siRNAs or with control siRNAs. Samples were normalized 
by input DNA amount or by cellular gene (HMBS) copy number. The RT and 
replication assays were done three to five times and the standard deviations are 
shown (Supplementary Tables 7, 10 and 11). *P < 0.05 (Kruskal-Wallis test 
with Dunn’s correction for multiple comparisons). 
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Vif hijacks CBF-B to degrade APOBEC3G and 


promote HIV-1 infection 


Stefanie Jager’**, Dong Young Kim**, Judd F. Hultquist**, Keisuke Shindo*, Rebecca S. LaRue*, Eunju Kwon’, Ming Li*, 
Brett D. Anderson‘, Linda Yen*, David Stanley*, Cathal Mahon)*?, Joshua Kane!’, Kathy Franks-Skiba!?, Peter Cimermancic”®, 
Alma Burlingame’, Andrej Sali?*°, Charles S. Craik, Reuben S. Harris*, John D. Gross**° & Nevan J. Krogan’”"*’ 


Restriction factors, such as the retroviral complementary DNA 
deaminase APOBEC3G, are cellular proteins that dominantly block 
virus replication’ *. The AIDS virus, human immunodeficiency virus 
type 1 (HIV-1), produces the accessory factor Vif, which counteracts 
the host’s antiviral defence by hijacking a ubiquitin ligase complex, 
containing CUL5, ELOC, ELOB and a RING-box protein, and 
targeting APOBEC3G for degradation* '®. Here we reveal, using 
an affinity tag/purification mass spectrometry approach, that Vif 
additionally recruits the transcription cofactor CBF-B to this 
ubiquitin ligase complex. CBF-f, which normally functions in 
concert with RUNX DNA binding proteins, allows the reconstitu- 
tion of a recombinant six-protein assembly that elicits specific 
polyubiquitination activity with APOBEC3G, but not the related 
deaminase APOBEC3A. Using RNA knockdown and genetic com- 
plementation studies, we also demonstrate that CBF-f is required 
for Vif-mediated degradation of APOBEC3G and therefore for 
preserving HIV-1 infectivity. Finally, simian immunodeficiency 
virus (SIV) Vif also binds to and requires CBF-B to degrade rhesus 
macaque APOBECS3G, indicating functional conservation. Methods 
of disrupting the CBF-B- Vif interaction might enable HIV-1 restric- 
tion and provide a supplement to current antiviral therapies that 
primarily target viral proteins. 

Mammals have evolved cellular proteins termed restriction factors 
that function to prevent the spread of mobile genetic elements includ- 
ing retroviruses’ *. As a counter-defence, most retroviruses, including 
the human pathogen HIV-1, have developed mechanisms to prevent 
restriction, often through subversion of the host’s ubiquitin—proteasome 
system. In eukaryotic cells, 8.6-kDa ubiquitin moieties are added to a 
target protein by sequential action of one of two ubiquitin-activating 
enzymes (E1), which transfer ubiquitin to a pool of dozens of ubiquitin- 
conjugating enzymes (E2) that, in turn, collaborate with hundreds of 
ubiquitin ligases (E3) to catalyse transfer to specific substrates'’. If more 
than four ubiquitins are joined together through K48 linkages, the target 
protein is usually degraded by the 26S proteasome”. At least three HIV- 
1 proteins, Vif, Vpu and Vpr, hijack cullin-RING E3 ligases consisting of 
CUL5, CUL] and CUL4A to promote ubiquitination and degradation of 
APOBEC3 family members (for example, APOBEC3G, A3G), BST2/ 
tetherin and an unknown, putative restriction factor, respectively”. 
Understanding the composition of cullin-RING E3 ligase complexes 
and the underlying cellular signalling components may provide thera- 
peutic routes for treating a variety of human diseases, including infec- 
tion by HIV-1. 

HIV-1 Vif is recruited to CUL5 by virtue of its SOCS box, which 
contains an elongin C binding helix (the BC-box), a conserved HCCH 
Zn binding motif and a short Cullin Box*®. Although a structure of the 
BC-box peptide in complex with the heterodimer of Elongin B and C 


(ELOBC) has been reported”, the architecture of the full-length Vif in 
complex with host factors has remained elusive, in part because Vif 
complexes have poor solubility and activity. We therefore reasoned 
that Vif may bind an additional host factor and that such a factor may 
render it more tractable in vitro. 

We took an unbiased proteomic approach to identify host factors 
that bind all 18 HIV processed and polyproteins using an affinity tag/ 
purification mass spectrometry (AP-MS) approach’*">. To this end, 
2X Strep and 3 Flag was fused to the carboxy (C) terminus of these 
factors, including Vif. The tagged Vif construct was both transiently 
transfected into HEK293 cells and used to make a stable, tetracycline- 
inducible Vif-Strep-Flag Jurkat T cell line (Fig. 1a). Epitope-tagged 
Vif was purified from both cell types using antibodies specific to either 
Strep or Flag and aliquots of the co-purifying proteins were subjected 
to SDS-polyacrylamide gel electrophoresis (SDS-PAGE) (Fig. 1b). 
Materials from each step were analysed by mass spectrometry”. 

Using a new scoring system for data derived from AP-MS studies, 
termed Mass Spectrometry Interaction Statistics (MiST)*, we iden- 
tified 24 Vif-human protein-protein interactions with seven of them 
found in both cell types (Fig. 1c). Seventeen of these were verified 
independently by co-immunoprecipitation (Supplementary Fig. 1). 
Among these were the components of the E3 ubiquitin ligase complex, 
CULS5, ELOB and ELOC, known to interact with Vif and trigger A3G 
degradation* °*"”. Although the RING-box protein RBX1 was originally 
reported as part of this complex*, only RBX2 was above the MiST score 
threshold used’ consistent with recent work showing that it binds 
CULS (refs 16, 17). We did not find endogenous A3G, probably because 
of its poor expression in HEK293 and Jurkat cell lines exacerbated by 
further depletion through Vif-mediated degradation. We did find Vif 
associating with two proteins that function in autophagy, AMRA1 and 
SQSTM, as well as with the transcriptional co-repressor complex 
NCOR1/HDAC3/GPS2/TBLIR (the last only in T cells) (Fig. Ic). 
Also, in both cell types, Vif was found to interact with the transcription 
cofactor CBF-B, which is known to heterodimerize with the RUNX 
family of transcription factors'®. 

To determine if any of the newly defined Vif interactors belong to 
the Vif-CUL5 complex, we performed double affinity purifications 
using cells co-transfected with Vif-2Strep and either A3G- or 
CUL5-3 Flag (Fig. 1d, e). After purification first with Strep-Tactin 
and second with anti-Flag beads, mass spectrometry analysis of the 
final elution revealed the presence of CUL5, ELOB, ELOC, RBX2 and 
invariably CBF-B, strongly suggesting that this last protein may be a 
new component of the Vif E3 ubiquitin ligase complex (other factors 
from single Vif purifications depicted in Fig. 1c were not present). To 
confirm this interaction and the composition of the complex, we per- 
formed an additional double affinity purification experiment using 
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Figure 1 | AP-MS experiments identify CBF-f as a Vif-dependent 
component of the Vif-CUL5 ubiquitin ligase complex. a, Flow-chart of the 
proteomic analysis performed during the study. b, Affinity-tagged versions of 
Vif, Vpu and Vpr were purified using 3x Flag from HEK293 and Jurkat cells, 
subjected to SDS-PAGE and stained with silver. Visible bands corresponding 
to interactions that are known for each accessory factor are labelled. Note Vif 
and CBF-f run at a similar place on the gel. Tagged versions of Vpr and Vpu 
were used as specificity controls. c, A network representation of Vif—host 
protein-protein interactions from both HEK293 (blue) and Jurkat T cells (red) 
after subjecting the data derived from the AP-MS analysis to the MiST scoring 
system’. The intensity of the node colours corresponds to the quantitative 
MisST score. Blue edges represent interactions derived during this work; black 
edges are previously described interactions between host factors; dashed edges 


Vif-+2 Strep and CBF-$-3Flag. This strategy also yielded CUL5, 
ELOB, ELOC and RBX2, in addition to the epitope-tagged bait 
proteins (Fig. 1d, e). 

To determine if the association of CBF-B with the CULS ligase 
complex was dependent on Vif, we immunoprecipitated CULS5- 
haemagglutinin (HA) or ELOB-HA in the presence or absence of Vif 
in HEK293 cells and blotted for endogenous CBF-. Only in the presence 
of Vif did CBF-B co-immunoprecipitate with tagged CUL5 or ELOB, 
indicating that recruitment of CBF-f to the CULS ligase is dependent on 
Vif (Fig. 1f). SIV Vif also associated with CBF-B by immunoprecipita- 
tion, suggesting the interaction is conserved (Fig. 1g). 

We next asked if the Vif-CUL5 ligase could be reconstituted with 
CBF-B using recombinant proteins purified from Escherichia coli. Initial 
purification attempts without CBF-B yielded aggregated and inactive 
complexes, assayed by size-exclusion chromatography and autoubiqui- 
tination activity, suggesting that CBF-B may be required for complex 
formation (data not shown). Therefore, full-length Vif, ELOB, ELOC 
and CBF-f were co-expressed, purified to homogeneity and found to 
form a stable, monodisperse complex with recombinant CUL5/RBX2, 
as shown by size-exclusion chromatography and SDS-PAGE analysis 
(Fig. 2a, b). Pull-down experiments performed with purified, His-tagged 
APOBEC3 enzymes immobilized on cobalt-chelating resin showed that 
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correspond to previously described Vif-host interactions present in the 
database VirusMint. d, The double purification approach, which allows for the 
identification of stable, stoichiometric protein complexes. e, Double 
purifications were performed in triplicate using 3 Flag-tagged CUL5, A3G or 
CBF-f with 2 Strep-tagged Vif in HEK293 cells. Proteins that were identified 
in all three double purifications, after trypsin digestion and analysis by mass 
spectrometry, are represented. The coverage corresponds to the percentage of 
protein identified by tryptic peptides. f, Immunoblots showing that Vif recruits 
CBE-f to the CUL5/ELOBC/RBX2 ubiquitin ligase complex. HA-tagged ELOB 
or CULS5 were immunoprecipitated in the absence or presence of increasing 
amounts of Vif, and endogenous CBF-B was monitored by immunoblot. g, HIV 
and SIV Vif co-immunoprecipitate CBF-$ and ELOC. GFP and HIV Nef were 
analysed in parallel as specificity controls. 


the four protein complex containing Vif, CBF-B and ELOBC binds 
A3G, but not the related Vif-resistant deaminase, A3A (Fig. 2c). 
These observations suggested that Vif, CBF-§ and ELOBC form a sub- 
strate adaptor for CUL5/RBX2 that enables specific interaction with 
susceptible A3 proteins. 

To test the activity of the reconstituted six protein complex, CUL5/ 
RBX2/ELOB/ELOC/Vif/CBF-B (CRL5-Vif-CBEF-B), we assayed sub- 
strate and Vif ubiquitination activities using two distinct and well char- 
acterized ubiquitin conjugating enzymes, UBE2R1 (hCDC34a) and 
UBCH5Sb, which are capable of forming specific K48 and heterogenous 
ubiquitin chain linkages, respectively’’””. With UBE2R1, CRL5-Vif- 
CBF-B catalysed formation of high-molecular mass K48 chains on 
A3G, but not A3A (Fig. 2d, e), mirroring the chain linkage and substrate 
specificity observed in cells****-**. As with most ubiquitin ligase assem- 
blies, the CRL5-Vif-CBF-f complex also possessed autoubiquitination 
activity that was only marginally affected by substrate A3s (Sup- 
plementary Fig. 2). These experiments were done with NEDD8- 
modified CULS, because NEDD8ylation is required for CULS to degrade 
A3G in vivo* (Supplementary Fig. 3). Similarly, with UBCH5b, CRL5- 
Vif-CBF-B was able to promote the specific polyubiquitination of A3G 
and elicit Vif autoubiquitination activity (Supplementary Fig. 4). We 
conclude that the reconstituted Vif E3 ligase is specific for A3G, 
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Figure 2 | CBF-f is a stoichiometric component of the Vif E3 ubiquitin 
ligase. a, Size exclusion chromatography of recombinant purified CUL5/RBX2 
(blue) overlaid with CUL5/RBX2 mixed with 1.5 equivalents of purified Vif 
substrate adaptor containing Vif, ELOBC and CBF-f (red). b, Coomassie- 
stained SDS-PAGE of fractions labelled 1-3 in a indicating the Vif substrate 
adaptor and a six-protein assembly (CRL5-Vif-CBF-f) co-purify as stable 
monodisperse species. ¢, A3G, but not A3A, directly binds the tetrameric Vif 
substrate adaptor in pull-down experiments in vitro. d, CRL5-Vif-CBF- is an 
E3 ligase that promotes polyubiquitination of A3G, but not A3A (detected 


supports K48 chain formation and can function with at least two 
ubiquitin conjugating enzymes in vitro. It is conceivable that these 
two ubiquitin-conjugating enzymes work together in cells to promote 
multi-monoubiquitination of A3G followed by specific chain elonga- 
tion, as described for other RING E3s**”*, but additional work will be 
necessary to rule out other E2s in vivo. 
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using an anti-c-Myc antibody to the C-terminal tag on the deaminases). Ub, 
ubiquitin. e, CRL5-Vif-CBF-B and UBE2R1 catalyse formation of K48-linked 
chains on A3G. Immunoblots showing substrate in ubiquitination reactions 
containing UBE2R1 as E2, no ubiquitin, Me-ubiquitin, K48R-ubiquitin, K48- 
only ubiquitin or wild-type ubiquitin. Reactions with Me-ubiquitin indicate at 
least two distinct sites are modified on A3G; K48R recapitulates the pattern 
observed with Me-ubiquitin, whereas both wild type and K48R-only ubiquitin 
result in extensive polyubiquitin chains. 


To determine if CBF-f is required for Vif folding and/or stability in 
living cells, we transfected a constant amount of Vif into HEK293T 
cells expressing either a scrambled short hairpin (sh)RNA or a CBF-f- 
specific shRNA. The levels of steady-state Vif were threefold lower in 
CBF-B-depleted cells than in the scrambled control cells (Fig. 3a). 
Proteasome inhibitor MG132 reversed this effect, suggesting that Vif 
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Figure 3 | CBF-f and Vif collaborate to degrade APOBEC3G and enable 
HIV-1 infectivity. a, CBF-B-depleted HEK293T cells have lower steady-state 
Vif levels, which recover upon treatment with 2.5 uM MG132. b, Infectivity of 
replication-competent, Vif-proficient HIV-1 in the presence and absence of 
CBF-B and A3G (n = 3; mean, s.d.). Immunoblots are shown for the indicated 
proteins in virus-producing cells and viral particles. c, Infectivity of HIV-GFP 
produced using HEK293T-shCBF-f or HEK293T-shControl clones 
transfected with the single-cycle virus cocktail, A3G, Vif and CBF-B as 
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indicated (n = 3; mean, s.d.). The corresponding immunoblots are shown 
below. d, Infectivity of a Vif-deficient HIV-1 molecular clone produced in the 
presence or absence of human or rhesus A3G-HA, HIV or SIV Vif-Myc, and 
CBF-f as indicated (nm = 3; mean, s.d.). Immunoblots are shown for the 
indicated proteins in virus-producing cells and viral particles with two 
exposures of the anti-Myc (Vif) blot shown to clarify the SIV Vif signal (the 
longer exposure also shows endogenous c-Myc). 
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Figure 4 | Model for Vif-CBF-f E3 ligase formation and APOBEC3G 
polyubiquitination and degradation. Vif is depicted hijacking cellular CBF-B 
to the E3 ubiquitin ligase complex required for A3G polyubiquitination and 
degradation. Vif may recruit newly translated CBF-B (not shown) and/or hijack 
existing CBF-B from RUNX transcription complexes. 


degradation is accelerated without CBF-B. Analogous data were 
obtained when Vif was expressed from a proviral plasmid in CBF-B- 
depleted cells and complemented with a CBF-8 expression plasmid 
(Supplementary Fig. 5). 

Based on these observations, we predicted that CBF-$ knockdown 
should result in less functional Vif and less infectious HIV-1 particles 
when produced in the presence of A3G. To test this prediction, shRNA 
was used to deplete CBF-f stably in HEK293T cells, and a knockdown 
clone was used to produce replication competent Vif-proficient HIV-1 
in the presence or absence of A3G and CBF-f expressed from plasmids 
(Fig. 3b). In CBF- depleted cells, steady-state Vif levels were very low 
despite equivalent levels of virus production as indicated by capsid. 
Moreover, Vif levels increased when CBF-B was replenished by com- 
plementation, and this correlated with decreases in cellular and viral 
A3G levels and corresponding increases in viral infectivity. In the 
absence of A3G, no difference in infectivity was observed regardless 
of cellular CBF-B or Vif levels. Titration experiments showed that 
CBF-B complementation is dose-responsive (Supplementary Fig. 6). 
Analogous results were obtained with a multi-vector HIV/green 
fluorescent protein (GFP) system (Fig. 3c and Supplementary 
Fig. 7). The Vif/CBF-f interaction was confirmed in virus-producing 
cells by co-immunoprecipitation experiments (Supplementary Fig. 8). 
Furthermore, SIVmac239 Vif requires CBF-B to degrade rhesus 
macaque A3G and promote viral infectivity (Fig. 3d). Interestingly, in 
contrast to HIV Vif, lower steady-state levels of SIV Vif were observed 
in the presence of CBF-f, which may be functionally significant or may 
be a consequence of the heterologous assay system (that is, expressing 
SIV/thesus proteins in human cells). Nevertheless, these results demon- 
strate the essential and conserved nature of CBF-8 for Vif function in 
promoting A3G degradation and efficient virus replication. 

Our proteomic, biochemical and genetic studies combine to suggest a 
model in which HIV-1 Vif hijacks the cellular transcription factor CBF- 
B to facilitate Vif folding and/or stability as well as nucleation of the rest 
of the E3 ubiquitin ligase complex (Fig. 4). CBF-B is required for A3G 
substrate binding and, ultimately, for polyubiquitination and degrada- 
tion, thereby enabling the production of infectious viral particles. 
Because genetic studies have shown that Vifis also capable of degrading 
APOBEC3F and several other human APOBEC3 proteins”*** most of 
which are expressed in primary CD4+ T lymphocytes*®”’, it is quite 
likely that CBF-B is required for counteracting multiple endogenous 
APOBEC3s and thus for rendering T lymphocytes permissive for 
HIV-1 replication. We anticipate that the development of antiviral 
therapies that antagonize the CBF-B-Vif interaction will be more 
powerful than those that specifically target the A3G-Vif interaction, 
because they have the potential to unleash the simultaneous restriction 
potential of multiple APOBEC3s analogous to current combinatorial 
therapies. 
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METHODS SUMMARY 


Affinity tagging, purification’* and ubiquitination assays** were performed as 
described. Recombinant A3A and A3G were purified as Myc-His tagged proteins 
from HEK293 cells. HIV-1 infectivity studies used an HIV-1yy, proviral DNA 
construct (with or without Vif) or an HIV-GFP reporter plasmid set. Control 
(RHS4346) and CBF-B (RHS4430-99161432) shRNA constructs were obtained 
from Open Biosystems. A CBF-B complementary DNA matching NM_001755.2 
was cloned from the CEM T cell line by RT-PCR. Immunoblots used antibodies to 
A3G (National Institutes of Health (NIH) ARRRP 10201 courtesy of J. Lingappa), 
CBF-B (Santa Cruz Biotechnology), HA (HA.11; Covance), TUB (tubulin; 
Covance), c-Myc (Sigma), Vif (NIH ARRRP 2221 courtesy of D. Gabuzda) and 
p24/capsid (NIH ARRRP 3537 courtesy of B. Chesebro and K. Wehrly). Details are 
provided in the Supplementary Methods. 
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T-cell differentiation factor CBF-B regulates HIV-1 
Vif-mediated evasion of host restriction 


Wenyan Zhang'*, Juan Du**, Sean L. Evans”, Yunkai Yu* & Xiao-Fang Yu"? 


The human APOBEC3 cytidine deaminases are potent inhibitors 
of diverse retroviruses, including human immunodeficiency virus-1 
(HIV-1)'°. HIV-1 Vif forms an E3 ubiquitin ligase complex with 
cullin 5 (CUL5), elongin B and elongin C 7°, which promotes the 
polyubiquitination and degradation of APOBEC3 substrates”’°"*. 
Here we demonstrate in human T cells that core binding factor B 
(CBF-B) is a key regulator of the evasion of HIV-1 from the host 
defence mediated by APOBEC3. CBF-B, the non-DNA-binding 
subunit ofa heterodimeric transcription factor, regulates the folding 
and DNA-binding activity of partner RUNX family proteins, which 
have important roles in the development and differentiation of 
diverse cell types, including T lymphocytes’*"*. In our study, knock- 
down of endogenous CBF-B blocked Vif-induced APOBEC3G 
polyubiquitination and degradation. CBF-B was not required 
for the interaction between Vif and APOBEC3G, yet was essential 
for the assembly of the Vif-CUL5 E3-ubiquitin-ligase complex. 
CBF-B proved to be a unique regulator of primate lentiviral Vif 
and not a general component of the CUL5 E3 ubiquitin ligase. We 
show that Vif and CBF-B physically interact, and that the amino- 
terminal region of Vif is required for this interaction. Furthermore, 
interactions with Vif required regions in CBF-f that are not involved 
in RUNX protein binding’”"’. Considering the importance of the 
interaction between Vif and CBF-f, disrupting this interaction 
represents an attractive pharmacological intervention against 
HIV-1. 

To identify novel cellular proteins involved in HIV-1 Vif function, we 
infected human CD4" T cells (H9) with HIV-1 Vif-haemagglutinin 
(HA) or HIV-1 (control) virus’? and then characterized the interac- 
tion of cellular proteins with HIV-1 Vif-HA in the infected cells by 
co-immunoprecipitation analysis. HIV-1 Vif forms an E3 ubiquitin 
ligase with CULS5 and elongin B/elongin C (ELOB/C) to promote the 
polyubiquitination and degradation of various substrates, including 
APOBEC3 and Vif itself*’°. To limit the potential degradation of 
these co-factors, we treated the cells with the proteasome inhibitor 
MG132 before co-immunoprecipitation. HA-tagged Vif was immuno- 
precipitated with anti-HA antibody from HIV-1 Vif-HA-infected H9 
cell lysates but not HIV-1 (control)-infected H9 cell lysates (Fig. 1a). 
Immunoprecipitation of Vif-HA was confirmed by immunoblotting 
with a HA-specific antibody (Fig. 1b). Untagged Vif was not immuno- 
precipitated from control-infected H9 cells (Fig. 1b, lane 2). CULS 
(84kDa) was co-precipitated with Vif-HA (Fig. la, lane 1) and 
confirmed by immunoblotting with a CUL5-specific antibody (Fig. 1b, 
lane 1); ELOB/C was also co-precipitated with Vif-HA (Fig. 1b, lane 1). 
These proteins were not co-precipitated from control HIV-1-infected 
H9 cells (Fig. 1b, lane 2). 

A previously uncharacterized protein of ~22 kDa also co-precipitated 
with Vif-HA (Fig. la, lane 1). This protein was identified by mass spec- 
trometry analysis as CBF-B, a ubiquitously expressed host protein’*"®. 
The identification of the protein was confirmed by immunoblotting 
with a CBF-f-specific antibody (Fig. 1b, lane 1). We also detected an 


interaction between HIV-1 Vif-HA and CBF-f in transfected 293T 
(human embryonic kidney) cells (Fig. 1c, lane 4), indicating that the 
interaction between CBF-fB and HIV-1 Vif can occur in the absence of 
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Figure 1 | Characterization of the functional interaction between HIV-1 Vif 
and CBF-f. a, Co-immunoprecipitation of cellular proteins with Vif-HA. Cell 
lysates from H9 cells infected with HIV-1 Vif-HA (HXB2Neo-Vif-HA, lane 1) 
or with control HIV-1 (that is, HXB2Neo containing untagged Vif, lane 2) were 
immunoprecipitated (IP) with anti-HA antibody conjugated to agarose beads, 
followed by SDS-PAGE and silver staining. The identification of CBF-B co- 
precipitated with Vif-HA (lane 1) was achieved by mass spectrometry. 

b, Immunoblot of precipitated samples from HIV-1- or HIV-1 Vif-HA- 
infected H9 cells using antibodies against CUL5, CBF-B, ELOB, ELOC or HA 
(to detect Vif-HA). c, Co-immunoprecipitation of endogenous CBF-f with 
Vif-HA from transfected 293T-cell samples. d, Vif-mediated downregulation 
of APOBEC3G (A3G) requires CBF-f. 293T cells were co-transfected with 
expression vectors for APOBEC3G-HA plus NL4-3, NL4-3AVif, or a control 
vector (pcDNA3.1) in the presence of control siRNA (lanes 1-3) or siRNA 
against CBF-B (lanes 4-6). e, CBF-B is required for Vif-mediated exclusion of 
APOBEC3G from virions. e, f, Supernatants from transfected 293T cells as 
described in d were collected and analysed for APOBEC3G-HA virion 
packaging (e) and viral infectivity (f). f, Virus infectivity was assessed by 
multinuclear activation of a galactosidase indicator (MAGI) assay. Virus input 
was normalized by the amount of CAp24. Virus infectivity with NL4-3 (Vif- 
positive) produced from transfected 293T cells in the presence of control siRNA 
was set to 100%. Error bars in f and g represent the standard deviations from 
triplicate wells. g, Silencing CBF-B expression had no effect on viral infectivity 
in the absence of APOBEC3G. 
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Figure 2 | CBE-f is critical for the interaction between Vif and CULS5. 

a-d, 293T cells were transfected with the indicated expression vectors. Cell 
lysates were prepared and immunoprecipitated followed by immunoblot 
analysis as described in Fig. 1. a, Co- immunoprecipitation of CUL5, ELOB and 
ELOC with Vif-Myc but not CBF-B—Myc. b, Co-immunoprecipitation of 
endogenous CBF- with Vif-HA but not CUL5-HA. ¢, Silencing CBF-B 
inhibits the interaction between Vif and CULS. d, Silencing CUL5 does not 
affect the Vif-CBF-B interaction. 


other viral proteins. MG132 treatment increased the stability of HIV-1 
Vif but did not alter its interaction with CBF-B (Supplementary Fig. 1); 
RNase treatment also did not block its interaction with CBF-f. Using a 
live cell imaging approach, we observed co-localization of yellow fluor- 
escent protein (YFP)-Vif and cyan fluorescent protein (CFP)-—CBF-B 
in transfected 293T cells along with a positive fluorescence resonance 
energy transfer (FRET) signal, further validating a physical interaction 
(Supplementary Fig. 2). 

To determine whether CBF-f is required for the function of HIV-1 
Vif, we knocked down endogenous CBF-f expression by CBF-f- 
specific short interfering RNA (siRNA). As compared to control 
siRNA-treated cells (Fig. 1d, lanes 1-3), CBF-B was reduced by 
~70% in 293T cells treated with CBF--specific siRNA (lanes 4-6). 
In the control siRNA-treated cells, HIV-1 Vif efficiently reduced the 
expression of APOBEC3G (lane 3) when compared to HIV-1 NL4-3 
strain Vif-negative (NL4-3AVif) virus-expressing cells (lane 2). 
Silencing of endogenous CBF-f (lanes 4-6) impeded the ability of 
Vif to reduce the expression of APOBEC3G (lane 6). Because HIV-1 
Vif mainly reduces the expression of APOBEC3G by inducing its 
degradation, these data indicated that CBF-f is required for the 
HIV-1 Vif-mediated degradation of APOBEC3G (Fig. 1d, compare 
lanes 2 and 3 to lanes 5 and 6). Interestingly, expression of exogenous 
CFP-CBF-B enhanced both wild-type, untagged Vif (pcDNA-hVif)- 
and YFP-Vif-induced APOBEC3G degradation in 293T cells (Sup- 
plementary Fig. 2). 

To exert its antiviral activity, APOBEC3G must be packaged into 
HIV-1 virions so that it can then inhibit virus infectivity in new target 
cells; Vif suppresses this antiviral activity by limiting the packaging of 
APOBEC3G into virions. Indeed, we found that when Vif was present, 
APOBEC3G was efficiently excluded from released HIV-1 virions in 
control siRNA-treated cells (Fig. le, lane 3); when Vif was absent, 
APOBEC3G could be packaged into HIV-1 virions (Fig. le, lane 2). 
However, the ability of Vif to exclude APOBEC3G from released HIV- 
1 virions was markedly compromised when CBF-f expression was 
knocked down in virus-producing cells (Fig. le, compare lanes 3 
and 6). Reducing the expression of CBF-f specifically blocked the 
ability of HIV-1 Vif to suppress the antiviral activity of APOBEC3G 
(Fig. 1f). In contrast, silencing CBF-f expression had no effect on the 
viral infectivity of NL4-3 or NL4-3AVif when APOBEC3G was absent 
(Fig. 1g), indicating that CBF-B regulates HIV-1 infectivity only in the 
presence of APOBEC3G. 

CBF-f apparently does not interact directly with APOBEC3G and is 
not required for the Vif-A POBEC3G interaction (Supplementary Fig. 3). 
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Human APOBEC3F is another human cytidine deaminase that has 
potent anti-HIV-1 activity; however, APOBEC3F and APOBEC3G 
are recognized by HIV-1 Vif through distinct mechanisms”*’. We 
postulated that if CBF-f is not involved in Vif-substrate recognition, 
it most likely regulates Vif mediated suppression of APOBEC3F as well 
as APOBEC3G. Indeed, HIV-1 Vif-mediated degradation, virion 
exclusion and suppression of APOBEC3F also required CBF-B (Sup- 
plementary Fig. 4). Thus, CBF-f is apparently required for Vif activity 
at a critical step(s) other than substrate recognition. 

Because CBF-f does not affect the interaction between HIV-1 Vif and 
APOBEC3G, it is possible that CBF-f influences the ubiquitin ligase 
activity of Vif-CUL5 E3 and could in fact be a previously unrecognized 
component of the CUL5-ELOB/C E3 ubiquitin ligase. To determine 
whether CBF-f is an integral component of this complex, we examined 
the interaction of CBF-$ with CUL5 and ELOB/C. Co-immunoprecipi- 
tation of CUL5 and ELOB/C with HIV-1 Vif-Myc (Fig. 2a, lane 6) was 
readily detected, but there was no co-immunoprecipitation of CUL5 or 
ELOB/C with CBF-B—Myc under the same conditions (lane 5). We also 
saw no interaction between endogenous CBF-8 and CUL5-HA in a 
reciprocal co-immunoprecipitation analysis (Fig. 2b, lane 6), despite a 
readily detectable interaction between CBF-B and Vif-HA (lane 5). 
Therefore, CBF-f is unlikely to be an integral component of the CUL5- 
ELOB/C E3 complex under physiological conditions. 

To determine whether CBF-f might have a role in the assembly of the 
Vif-CULS E3 ubiquitin ligase, we examined the interaction of Vif-HA 
with endogenous CULS and ELOB/C by co-immunoprecipitation ana- 
lysis. Reducing CBF-f expression with specific siRNA had no effect on 
the intracellular expression of CUL5 or ELOB/C (Fig. 2c, compare lanes 1 
and 2 to lanes 3 and 4). CUL5 and ELOB/C were co-immunoprecipitated 
with HIV-1 Vif-HA by the anti-HA antibody in control siRNA-treated 
cells (lane 6), but not in control cells that did not express HIV-1 Vif-HA 
(lanes 5 and 7). Co-immunoprecipitation of CUL5 with HIV-1 Vif-HA 
was significantly inhibited when CBF-f was silenced (Fig. 2c, lane 8) but 
co-precipitation of ELOB/C with HIV-1 Vif-HA was not (lane 8). 
Although silencing of CBF-f inhibited the interaction of Vif with 
CULS (Fig. 2c), reciprocal silencing of CULS5 (Fig. 2d, lane 4) had no 
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Figure 3 | Identification of regions in CBF-f is important for HIV-1 Vif 
interaction. a, Diagrams of mutant CBF-B constructs. The numbers in 
parentheses indicate amino acids that were deleted in the mutant CBF-B 
constructs. b, c, Co-precipitation of full-length (wild-type; WT) CBF-B-Myc 
and CBF-B mutants with HIV-1 Vif—HA. Cell lysates and precipitated samples 
were analysed by immunoblotting with an anti-Myc or anti-HA antibody. 

d, The structures of CBF-B (yellow) complexed with RUNX1 (pink) have been 
well characterized'’’’. Loop 3 (red) and helix 4 (blue) of CBF-B, which are 
located away from the interfaces between CBF-f and RUNX1, are important 
for the Vif-CBF-B interaction. 
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Figure 4 | Vif-induced polyubiquitination of APOBEC3G requires CBF-B. 
a-c, 293T cells were transfected with an expression vector encoding 
APOBEC3G-HA (A3G-HA), Vif, Myc-tagged ubiquitin (Ub) or control 
vector as indicated. All experiments were performed in the presence of MG132. 
a, Cells were lysed and analysed by immunoblot analysis using an anti- Vif, anti- 
CBFE-B or anti-HA antibody. b, Cell lysates were immunoprecipitated with the 
anti-HA antibody and analysed by immunoblotting with an anti-Myc antibody 
to detect ubiquitinated APOBEC3G. Unmodified APOBEC3G-HA was 
detected with the anti-HA antibody. c, Proposed models for the role of CBF-B 
in HIV-1 Vif function. 


detectable effect on the CBF-B-Vif interaction (Fig. 2d, lane 8). Again, 
these data indicate that CBF-B is not a general co-factor of CUL5, and 
CBF-B uniquely regulates the interaction of HIV-1 Vif with CULS5. 

Wealso observed that the activity of the adenoviral E4orf6-containing 
CUL5-ELOB/C ubiquitin ligase was not affected by CBF-B silencing 
(Supplementary Fig. 5). In addition, cellular SOCS-box-containing 
proteins (that is, ASB6 and SOCS3) (Supplementary Fig. 5), which 
are known to assemble with the CUL5-ELOB/C E3 ubiquitin ligase, 
did not interact with CBF-B. Also, although primate lentiviral Vif 
interacted with CBF-B, Vif from bovine immunodeficiency virus 
(BIV) did not (Supplementary Fig. 6). BIV Vif-mediated bovine 
APOBEC3 depletion was not affected by CBF-f silencing. 

CBF-B is a ubiquitously expressed component of a heterodimeric 
transcription factor including RUNX family proteins’*"*. The interface 
between CBF-f and RUNX proteins is well defined’”"’. To explore the 
regions of CBF-8 and HIV-1 Vif that are critical for their interaction, 
we first examined the Vif-binding activity ofa series of in-frame CBF-B 
deletion mutants (Fig. 3a). The regions of CBF-f that were important 
for the interaction with HIV-1 Vif mapped to loop 3 and helix 4 
(Fig. 3b, c), regions that do not interact with RUNX proteins (Fig. 3d). 

Consistent with the idea that HIV-1 Vif does not compete with 
RUNX for binding to CBF-B, we observed that the interaction of 
RUNX1 with CBF-B was not affected by the presence of Vif (Sup- 
plementary Fig. 7); in fact, a possible co-precipitation of Vif-HA with 
RUNX1-Myc was detected (Supplementary Fig. 7). RUNX1-mediated 
transcriptional activity”* was also not significantly affected by Vif, as 
measured in a CBF promoter-driven luciferase reporter assay (Sup- 
plementary Fig. 7). We determined that the N-terminal regions of Vif, 
including amino acids W21 and W38, were important for CBF-B 
binding (Supplementary Fig. 8). 

To determine whether CBF-f can influence the HIV-1 Vif-induced 
polyubiquitination of APOBEC3G, we transfected 293T cells with 
various combinations of expression vectors for HIV-1 Vif-Myc, 
APOBEC3G-HA and Myc-tagged ubiquitin in the presence of control 
siRNA or siRNA against CBF-B (Fig. 4a). To inhibit APOBEC3G 
degradation, all transfected cells were treated with MG132. HA-tagged 
APOBEC3G was immunoprecipitated by the anti-HA antibody, 
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followed by the detection of ubiquitinated APOBEC3G-HA using the 
antibody against Myc-tagged ubiquitin. Polyubiquitinated APOBEC3G- 
HA was detected in cells expressing Vif (Fig. 4b, lane 4); however, the 
Vif-induced polyubiquitination of APOBEC3G was inhibited by 
siRNA against CBF-B (lane 8). 

Taken together, our results demonstrate that CBF-B is a key regu- 
lator of HIV-1 Vif function. We have further shown that CBF-B is 
critical for Vif-induced endogenous APOBEC3G degradation and 
HIV-1 replication in H9 CD4” T cells (Supplementary Fig. 9). CBF- 
B allosterically regulates the conformation and DNA-binding activity 
of RUNX proteins'””. It remains to be determined, however, whether 
CBF-B can induce conformational changes in HIV-1 Vif that allow 
CULS binding to occur. Alternatively, CBF-B may stabilize the inter- 
action between Vif and CULS5 in the E3 ubiquitin ligase complex 
(Fig. 4c). Vif-CBF-B binding is critical for HIV-1 infection but does 
not seem to interfere with normal CBF-B-RUNX interaction. Thus, 
the interface between Vifand CBF-f may constitute a promising target 
for new anti-HIV drug discovery. 


METHODS SUMMARY 

Live-cell imaging and FRET assay. 293T cells were transfected with YFP-Vif 
(2 kg) and CFP-CBF-f (0.25 pg) using Lipofectamine 2000 (Invitrogen) in 6-well 
coverslip glass-bottom cell culture dishes (MatTek) at 37°C with 5% CO. 
Eightfold excess YFP-Vif was used due to low expression of YFP-Vif compared 
to CFP-CBF-B (Supplementary Fig. 2d). YFP-CFP dimer (0.25 1g), co-transfec- 
tion of YFP (0.25 jig) plus CFP (0.25 jg), and CFP-CBF-f alone were used for a 
positive and negative FRET signal as imaging controls. Cells were visualized 24- 
36h after transfection using a Zeiss LSM510-Meta confocal imaging system 
equipped with four argon lasers (458, 477, 488 and 514 nm lines), two HeNe lasers 
(542 and 633 nm), and one diode laser (405 nm). YFP and CFP were excited using 
a 514nm and 458 nm laser, respectively. After excitation, channel mode images 
were collected using a long-pass filter for CFP emission (LP 420) or band-pass 
filter for YFP emission (BP 530-600). In addition, the channel mode detector gain 
was set so that neither CFP nor YFP images contained saturated pixels before 
bleaching. A total of ten images (five pre-bleach and five post-bleach) were 
acquired for each cell with a X63 objective with <2 zoom, and image analysis 
and manipulation was performed using Zeiss LSM imaging software. FRET ana- 
lysis was performed using the acceptor photo-bleaching technique: YFP or YFP- 
Vif was photo-bleached for approximately 5 min at 514nm (100% laser power), 
and the average donor signal was evaluated before and after acceptor photobleach- 
ing. As a control, cells were also imaged under similar conditions, except the 
514nm laser was set at 0% transmission to prevent acceptor photobleaching. 
Two experiments were performed for each bleached sample for a total of ten cells 
each. One experiment was performed for each unbleached sample for a total of 
four cells each. The FRET efficiency percentage (E£) was calculated using the FRET 
macro of Zeiss LSM software. The following equation was used: E = (Fp,/ 
Fppost) * 100, where Fp, is the change in donor (CFP) fluorescence intensity 
before and after acceptor (YFP) bleaching and Fppost is donor fluorescence intensity 
after acceptor bleaching. The FRET macro of Zeiss LSM software also subtracts 
background when calculating E. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Plasmid construction. CBF-f coding sequences were amplified by reverse tran- 
scription and PCR using mRNA samples from H9 cells with the following primers: 
forward 5’-GCTAGCAAGATGCCGCGCGTCGTG-3’, reverse 5’-AAGCTTAC 
TACAGATCTTCTTCTGATATGAGTTTTTGTTCGGGTCTTGTTGTCTTCT 
TGCC-3’, containing Nhel and HindIII sites. The PCR product was cloned into 
pcDNA3.1 to generate pCBF-B-Myc. pCBF-BAN1, CBF-BAN2, CBF-BAN3, CBF- 
BAN4, CBF-BANS and CBF-BAN6 were made from pCBF-f-Myc by site-directed 
mutagenesis and confirmed by DNA sequencing. The following expression vectors 
have been previously described””°”*°: VR1012, HIV-1 Vif expression vectors 
pVif-Myc and pVif-HA, pAPOBEC3G-HA, pAPOBEC3G-Myc, pUbiquitin- 
Myc, pCUL5-HA, pAPOBEC3F-V5, E4orf6-Myc, SOCS3-HA, human p53, 
SIVmacVif-HA, SIVagmVif-Myc, BIVVif-HA, HXB2Neo and HXB2NeoVif- 
HA. pVifAN-HA was constructed with the following primers: forward 
5'-GTCGACATGGACCCTGAACTAGC-3’, reverse 5'-GGAATTCCTACGC 
GTAATCTGGGACGTCGTAAGGGTAGTGTTCCATTCATTGTGTGGCT-3’, 
containing SalI and Not! sites, respectively, and a C-terminal HA tag. pNL4-3AVif 
and pcDNA-hVif were gifts from K. Strebel. pRUNX1-Myc and p(CBF)4TKLuc 
were gifts from A. Friedman. NL43-AE-EGFP was a gift from R. Silicano. Plasmids 
pVifW11A, W21A, W38A, DR14/15AA, K22E, Y40A, RH41/42AA, T74A, E76A, 
R77A and W79A were made from pVif-Myc by site-directed mutagenesis and 
confirmed by DNA sequencing. To generate YFP N-terminal epitope-tagged 
NL4-3 Vif, pcDNA-hVif was used to PCR amplify the Vif coding region and 
cloned into the NotI and Xbal sites of pcDNA3-YFP (D. T. Yue). To generate 
CFP N-terminal epitope-tagged CBF-f, the CBF- coding region was cloned into 
the NotI and Xbal sites of pcDNA3-CFP (D. T. Yue). The YFP-CFP dimer 
construct was gift from D. T. Yue and was generated by fusing CFP and YFP with 
a 21 amino acid linker, SRAQASNSAVDGTAGPGSIAT. The following CBF-B 
shRNA clones were obtained from Open Biosystems: TRCN0000016643, with 
mature sense sequence 5’-CGAGAGTATGTCGACTTAGAA-3’; TRCN0000016644, 
5'-CCGCGAGTGTGAGATTAAGTA-3’; TRCN0000016645, 5’-GAAGATAGA 
GACAGGTCTCAT-3’; TRCN0000016646, 5’-GCTGGCAGTAACTGGCAAG 
AA-3'; TRCN0000016647, 5'-TGAGATTAAGTACACGGGCT-3’. 

Antibodies and cell lines. The H9 human CD4* T-cell line was maintained in 
RPMI-1640 medium (Invitrogen) with 10% fetal bovine serum and penicillin/ 
streptomycin (R-10 medium). 293T and MAGI-CCRS cells (AIDS Research and 
Reference Reagents Program, catalogue no. 3522) were maintained in Dulbecco’s 
modified Eagle’s medium (DMEM; Invitrogen) with 10% fetal bovine serum and 
penicillin/streptomycin (D-10 medium) and passaged when confluent. The fol- 
lowing antibodies were used: anti-HA (Covance, MMS-101R-1000), anti-Myc 
(Upstate, 05-724), anti-V5 (Invitrogen, R960-25), anti-B-actin (Sigma, A3853), 
anti-histone H3 (Genscript, A01502), anti-human ribosomal P antigens 
(Immunovision, HP0-0100), anti-HA antibody-agarose conjugate (Roche, 
11815016001), anti-ELOB (Santa Cruz, sc-1558), anti-ELOC (BD Transduction 
Lab, SIII/P15, 610760), anti-GAPDH (Sigma, G8795), anti-CBF-§ (Abcam, 
ab11921), anti-CUL5 (Santa Cruz, sc-13014), anti-p53 (Oncogene Research 
Products, OP03) and anti-GFP (Genscript, A01704); anti-CAp24 (catalogue no. 
1513) and anti-Vif antibodies (catalogue no. 2221) were obtained from the AIDS 
Research and Reference Reagents Program. 

RNA interference. RNAi against CBF-f was carried out using a pool of two 
duplexed siRNAs (Dharmacon): duplex 1, sense, 5’-CCAGCAGGAGGAUGC 
AUUAUU; antisense, 5’-PUAAUGCAUCCUCCUGCUGGUU; duplex 2, sense, 
5'-GCAGGCAAGGUAUAUUUGAUJU; antisense, 5’-PUCAAAUAUACCUU 


GCCUGCUU. RNAi against CULS was carried out using a pool of four duplexed 
siRNAs (Dharmacon): duplex 1, sense, 5’-GACACGACGUCUUAUAUUAUU; 
antisense, 5’-PUAAUAUAAGACGUCGUGUCUU; duplex 2, sense, 5’-CGUCU 
AAUCUGUUAAAGAAUU; antisense, 5’-PUUCUUUAACAGAUUAGAC 
GUU; duplex 3, sense, 5’-GAUGAUACGGCUUUGCUAAUU; antisense, 
5'-PUUAGCAAAGCCGUAUCAUCUU; duplex 4, sense, 5’-GUUCAACUACG 
AAUACUAAUU; antisense, 5'-PUUAGUAUUCGUAGUUGAACUU. 293T 
cells were transfected with the CBF-B or CUL5 siRNA pool at a total final con- 
centration of 100nM using Lipofectamine 2000 (Invitrogen). The non-targeting 
siRNA no. 2 (Dharmacon) was used as a control. Protein expression was 
monitored by immunoblotting 2-3 days after transfection. 
Identification of Vif-binding proteins. Vif-containing complexes were purified 
from HIV-1 Vif-HA and control HIV-1 (HXB2Neo) infected H9 cells by immuno- 
precipitation and analysed by SDS-PAGE. Gels were fixed in 50% methanol/10% 
acetic acid for 10min, stained with mass spectrometry-compatible colloidal- 
Coomassie brilliant blue G-250 (Bio-Rad 1610406) staining solution (20% methanol, 
8% ammonium sulphate, 1.6% phosphoric acid, 0.08% Coomassie blue G-250) to 
detect protein bands, and de-stained with distilled water. Protein standards from 
Bio-Rad (catalogue no. 1610314) were used to estimate protein size. Protein bands of 
interest were cut out of the gel and rinsed twice with 50% methanol (HPLC grade). 
In-gel digestion was performed on protein bands cut out of colloidal-Coomassie 
blue-stained SDS-polyacrylamide gels using sequencing-grade modified trypsin 
(Promega). Extracted peptides were co-crystallized in 2,5-dihydroxybenozic 
acid (DHB) or «-cyano-4-hydroxycinnamic acid (CHCA) (10mg ml! in 50% 
acetonitrile/0.3% TFA) and analysed by matrix-assisted laser desorption ioniza- 
tion time-of-flight (MALDI-TOF) spectrometry on a Voyager DE STR (Applied 
Biosystems, http://www.appliedbiosystems.com) using Voyager Instrument 
Control Panel (v. 5.1) and Data Explorer (v. 4.0) software. Data were acquired 
in reflector mode, and masses were externally calibrated using a standard peptide 
mixture to <50p.p.m. error. Proteins were identified by searching the acquired 
monoisotopic masses against the NCBI non-redundant or SwissProt databases 
using the MS-Fit search engine of ProteinProspector (http://prospector.ucsf.edu). 
APOBEC3G ubiquitination assay. 293T cells were transfected with expression 
vectors encoding APOBEC3G-HA, HIV-1 Vif and ubiquitin-Myc, either indi- 
vidually or in combination. Cells were also transfected with control or CBF-p- 
specific siRNA. Transfected cells were treated with 10 1M MG132 (proteasome 
inhibitor) for 16h, beginning at 24h after transfection, and then lysed in lysis 
buffer (50 mM Tris, pH 7.5, with 150 mM NaCl, 1% Triton X-100, 1 mM EDTA, 
1014M MG132 and complete protease inhibitor cocktail tablets), followed by 
centrifugation at 10,000g for 30 min. Lysates were applied to anti-HA antibody- 
conjugated agarose beads (Roche) and incubated for 3 h at 4 °C. After incubation, 
the beads were washed six times with washing buffer (20 mM Tris, pH 7.5, with 
0.1 M NaCl, 0.1 mM EDTA and 0.05% Tween 20), then eluted with elution buffer 
(0.1 M glycine-HCl, pH 2.0) followed by SDS-PAGE and immunoblotting with 
anti-Myc and anti-HA antibodies. 

Transfection, virus purification, immunoblot analysis, immunoprecipitation 
and viral infectivity (MAGI) assays were performed as previously described”. 


29. Luo, K. et al. Primate lentiviral virion infectivity factors are substrate receptors that 
assemble with cullin 5-E3 ligase through a HCCH motif to suppress APOBEC3G. 
Proc. Nat! Acad. Sci. USA 102, 11444-11449 (2005). 

30. Luo, K. et a/. Adenovirus E4orf6 assembles with Cullin5-ElonginB-ElonginC E3 
ubiquitin ligase through an HIV/SIV Vif-like BC-box to regulate p53. FASEB J. 21, 
1742-1750 (2007). 
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Acetyl coenzyme A (AcCoA) is the central biosynthetic precursor 
for fatty-acid synthesis and protein acetylation. In the conventional 
view of mammalian cell metabolism, AcCoA is primarily generated 
from glucose-derived pyruvate through the citrate shuttle and ATP 
citrate lyase in the cytosol’*. However, proliferating cells that 
exhibit aerobic glycolysis and those exposed to hypoxia convert 
glucose to lactate at near-stoichiometric levels, directing glucose 
carbon away from the tricarboxylic acid cycle and fatty-acid 
synthesis*. Although glutamine is consumed at levels exceeding 
that required for nitrogen biosynthesis’, the regulation and use of 
glutamine metabolism in hypoxic cells is not well understood. 
Here we show that human cells use reductive metabolism of 
a-ketoglutarate to synthesize AcCoA for lipid synthesis. This 
isocitrate dehydrogenase-1 (IDH1)-dependent pathway is active in 
most cell lines under normal culture conditions, but cells grown 
under hypoxia rely almost exclusively on the reductive carboxyla- 
tion of glutamine-derived o-ketoglutarate for de novo lipogenesis. 
Furthermore, renal cell lines deficient in the von Hippel-Lindau 
tumour suppressor protein preferentially use reductive glutamine 
metabolism for lipid biosynthesis even at normal oxygen levels. 
These results identify a critical role for oxygen in regulating carbon 
use to produce AcCoA and support lipid synthesis in mammalian 
cells. 

Although hypoxic cells exhibit a shift towards glycolytic metabolism’, 
a functional electron transport chain and glutamine-derived carbon 
are required for proliferation of most transformed cells®. In line with 
these studies, we observed increased glutamine consumption when 
A549 cells were cultured at approximately 1% oxygen while glutamate 
secretion remained unchanged (Supplementary Fig. 1a), indicating 
that net glutamine consumption was elevated and suggesting that 
glutamine carbon is used for biosynthesis. Notably, glucose consump- 
tion and lactate secretion also increased in these experiments. Con- 
sistent with this observation, we found that proliferating cells under 
both normoxia and hypoxia incorporate glutamine-derived carbon 
into lipids (Supplementary Fig. 1b). Glutamine can contribute carbon 
to lipogenic AcCoA through two distinct pathways. Cells can oxida- 
tively metabolize glutamine-derived o-ketoglutarate (wKG) in the 
tricarboxylic acid (TCA) cycle and generate pyruvate from malate 
by glutaminolysis’. Alternatively, some tissues can reductively carbo- 
xylate «KG to generate citrate”®, and recent studies have indicated that 
the IDH reaction is highly reversible”"''. To determine which pathway 
cells use to incorporate glutamine carbon into lipids we used stable 
isotopic tracers'**. We first cultured several cancer cell lines with 
[1-'°C]glutamine under normoxia and quantified the isotopic label 
present in metabolite pools along this pathway (Fig. la, red carbon 
atoms). All cells tested with this tracer retained significant label 
from [1-°C] glutamine in citrate and metabolites downstream of the 


irreversible ATP citrate lyase reaction, indicating that the reductive 
flux contributes to the cytosolic AcCoA pool (Supplementary Fig. 2). 
Additional evidence for activity along this pathway was obtained with a 
uniformly 3C-labelled ([U-'°Cs]) glutamine tracer (Supplementary 
Fig. 3). 

To quantify the specific contributions of oxidative and reductive 
glutamine metabolism to fatty-acid synthesis, we cultured cells in the 
presence of tracers for several days and performed isotopomer spectral 
analysis (ISA; Supplementary Fig. 4)'°. Use of [5-'*C]glutamine spe- 
cifically allows estimation of the flux of glutamine to lipids through 
reductive carboxylation (Fig. la, blue carbon atoms). Virtually all 
cell lines cultured with this tracer generated labelled fatty acids, 
metabolizing glutamine reductively in the TCA cycle to supply 10- 
25% of their lipogenic AcCoA (Fig. 1b and Supplementary Fig. 5). 
Consistent with these data, we were able to detect 98 + 5c.p.m. per 10° 
cells in hexane extracts of A549 cells cultured with (5-'*C] glutamine. 
Next we directly compared the contribution of glutamine to fatty acids 
through the reductive flux as a fraction of the total, the latter determined 
by using [U-'°C;]glutamine. In all cell lines tested, including those 
derived from lung, mammary, colon and squamous cell carcinoma as 
well as melanoma, glioblastoma and leukaemia, [5-°Cs] glutamine 
labelled most of the AcCoA derived from glutamine (Fig. 1b), 
highlighting the general use of reductive carboxylation as the primary 
route through which glutamine, glutamate and «KG carbon are con- 
verted to lipids in cultured cells (Fig. 1b and Supplementary Figs 2, 3 
and 5). The glutaminolysis pathway is also an important means of 
glutamine catabolism and can be characterized by quantifying the 
contribution of glutamine carbon to lactate. Consistent with published 
reports’, glutamine-derived 13¢ label was also detected in lactate, 
and the amount of ‘C-labelled lactate produced was highest in 
glioblastoma-derived cells compared with other cells cultured with 
[U-°Cs] glutamine (Supplementary Fig. 6). 

Mammalian cells express three IDH enzymes encoded by separate 
genes: IDH1 (cytosolic, NADP” -dependent), IDH2 (mitochondrial, 
NADP* -dependent) and the multi-subunit enzyme IDH3 (mitochon- 
drial, NAD ‘-dependent). IDH3 is allosterically regulated and is 
thought to operate in the oxidative direction. The NADP* -dependent 
isozymes are capable of catalysing the reductive reaction; however, the 
specific enzyme responsible for this flux is not definitively known’®. As 
measurements of metabolite pools in subcellular compartments and 
labelling therein cannot yet be reliably obtained, we used RNA inter- 
ference to knockdown expression of IDH1 and IDH2 selectively in 
A549 cells. Using labelling from [1-'*C]glutamine as a readout, we 
measured a significant and robust decrease in reductive carboxylation 
when IDH1 messenger RNA (mRNA) was targeted using short hairpin 
RNAs (shRNAs) (Fig. 1c, d). These changes were consistent with results 
using [U-°Cs] glutamine (Supplementary Fig. 7) and reproduced 
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Figure 1 | Reductive carboxylation is the primary route of glutamine to 
lipids. a, Schematic of carbon atom (circles) transitions and tracers used to 
detect reductive glutamine metabolism. Isotopic label from [1-'*C] glutamine 
(red) is lost during oxidative conversion to succinate (Suc) but retained on 
citrate (Cit), oxaloacetate (Oac), aspartate (Asp), malate (Mal) and fumarate 
(Fum) in the reductive pathway (green arrows). [5-3C] glutamine (blue) 
transfers label to AcCoA through reductive metabolism only. Molecular 
symmetry is shown for oxidative metabolism. Glc, glucose; Pyr, pyruvate; ME, 
malic enzyme; PDH, pyruvate dehydrogenase; Glu, glutamate; Gln, glutamine. 


using several cell lines (Supplementary Fig. 8). Finally, we used '°C 
metabolic flux analysis'’'* to quantify intracellular fluxes using 
[U-°C;]glutamine in A549 cells. The fitted data suggested that reductive 
IDH flux significantly decreased when IDHI1 protein levels were 
decreased, and this change was the primary alteration observed in the 
network (Fig. le; see Supplementary Tables 1-5 for complete results 
and metabolic flux analysis model description). 

Our results suggest that IDH1 can convert NADPH, «KG and CO, to 
isocitrate and NADP” in the cytosol. Enzymatic analysis using recom- 
binant protein indicated that IDH1 is indeed capable of consuming 
NADPH and is responsive to physiological levels of CO. (Sup- 
plementary Fig. 9). Importantly, the proliferation rate of all cell lines 
with IDH1 knockdown was impaired (Fig. 1f and Supplementary 
Fig. 8), indicating that reductive metabolism of «KG in the cytosol 
may be necessary for robust growth. In contrast to our results with 
IDH1 shRNAs, we detected no significant change in reductive flux 
when targeting IDH2 mRNA in A549, MDA-MB-231 and HCT116 
cells (Supplementary Fig. 10). Although IDH2 may promote reductive 
carboxylation in some tissues or conditions, these results are consistent 
with the proposed role of IDH2 as an oxidative TCA cycle enzyme”. 

Intriguingly, we detected a significant increase in reductive carbox- 
ylation activity when culturing cells under hypoxia (Supplementary 
Fig. 11). As glucose is usually the primary carbon source for mam- 
malian tissues'*, we next compared the contributions of reductive 
glutamine metabolism and glucose oxidation to fatty-acid synthesis 
by culturing cells with either [5-'*C]glutamine or uniformly labelled 
[U-13Cg] glucose under normal tissue culture conditions or hypoxia. 
Strikingly, oxygen levels influenced fatty-acid labelling from both 


IDH (oxidative) IDH (reductive) 


Time (h) 


b, Contribution of [5-'°C] glutamine and [U-'°Cs]glutamine to lipogenic 
AcCoA in cell lines. c, IDH1 levels in A549 cells expressing IDH1-specific 
(IDH1a and IDH1b) or control shRNAs. d, Metabolite labelling from 

[1-8] glutamine from cells in c. e, IDH flux estimates from 13C metabolic flux 
analysis model in control or IDH1-knockdown A549 cells cultured with 
[U-'°Cs]glutamine. f, Cell proliferation of A549 cells expressing control or 
IDH1-shRNAs. Error bars, 95% confidence intervals (b, e) and s.e.m. (n = 3) 
(d, f). *P < 0.05. 


tracers (Fig. 2a, b). Cells preferentially used glucose carbon for palmitate 
synthesis under normoxic conditions; however, fatty acids produced 
under hypoxia were primarily synthesized from glutamine carbon 
through the reductive pathway. In fact, the reductive carboxylation of 
glutamine-derived «KG accounted for approximately 80% of the carbon 
used for de novo lipogenesis in A549 cells growing under hypoxia 
(Fig. 2c). Conversely, we detected a concomitant decrease in the contri- 
bution of [U-’*Cg]glucose to fatty-acid synthesis under this condition. 
Significant increases in the relative use of this pathway were observed in 
all cell lines tested, including non-transformed cells (Supplementary 
Fig. 12). In addition, T lymphocytes freshly isolated from a mouse 
spleen preferentially used reductive glutamine metabolism over glucose 
oxidation for fatty-acid synthesis when activated under hypoxia (Sup- 
plementary Fig. 13). Although proliferation rates and relative de novo 
lipogenesis were lower under hypoxia (Supplementary Fig. 14a, b), the 
net flux of reductive glutamine metabolism to palmitate synthesis was 
significantly increased in hypoxic cultures (Fig. 2d). Knockdown of 
IDH1 protein mitigated the use of reductive glutamine metabolism 
for lipogenesis under hypoxia (Supplementary Fig. 14c). Although most 
human cells require glutamine for nucleotide and hexosamine bio- 
synthesis, some cell lines can grow in the absence of exogenous sources 
of glutamine”. Remarkably, we found that hypoxia increases the 
dependence of such cells on glutamine, as evidenced by decreased pro- 
liferation in the absence of glutamine and increased reductive glutamine 
metabolism under hypoxia when glutamine is present (Fig. 2e and 
Supplementary Fig. 15). 

To gain insight into the mechanisms controlling this switch to 
reductive glutamine metabolism, we analysed changes in the labelling 
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Figure 2 | Hypoxia reprograms cells to rely on reductive glutamine 
metabolism for lipid synthesis. a, b, Labelling of palmitate extracts from A549 
cells cultured under normoxia or hypoxia with [5-'*C] glutamine (a) or 
[U-?C,] glucose (b). Similar results were observed in myristate, oleate and 
stearate pools (not shown). c, Relative contribution of glucose oxidation 
({U-°Cg] glucose) or glutamine reduction ((5-13C] glutamine) to lipogenic 
AcCoA in A549 cells under normoxia and hypoxia. d, Absolute fluxes of 
[U-'°Cg] glucose and [5-'°C] glutamine to palmitate in A549 cells. Error bars, 
95% confidence intervals from model (c, d); *P < 0.05. e, Huh7 cell 
proliferation after 4 days in the presence or absence of glutamine. Error bars, 
s.e.m. (1 = 3) (a, b, e). **P< 0.005 comparing glutamine-free cultures. 
*P < (0.001 comparing normoxia and hypoxia. 


and abundances of TCA cycle metabolites in cells grown under 
hypoxia. Using [U-'*C,]glucose, we observed a significant decrease 
in relative flux through the pyruvate dehydrogenase (PDH) complex 
(Fig. 3a and Supplementary Fig. 16a, b). In addition, the citrate pool 
became depleted, which would be expected to increase reductive 
carboxylation flux through mass action (Fig. 3b and Supplementary 
Fig. 16c). On the other hand, we detected increased amounts of 
isotopic label in TCA cycle intermediates when using labelled 
glutamine tracers under hypoxia (Fig. 3c and Supplementary Fig. 16d, 
e). Reductively metabolized glutamine accounted for as much as 40- 
70% of the intracellular citrate, aspartate, malate and fumarate pools 
when cells were cultured in low oxygen (Fig. 3c). Given the marked 
reduction in PDH flux observed in hypoxia, we tested the ability of 
dichloroacetate (DCA) to restore PDH activity and mitigate the 
contribution of reductive metabolism to lipogenesis. DCA inhibits 
pyruvate dehydrogenase kinases (PDKs)”', and PDK1 is a known 
target of HIF-1o that inhibits the activity of PDH through phosphor- 
ylation”. Although DCA treatment had no observable effect on 
carbon use under normoxia, reductive glutamine metabolism was 
inhibited and glucose oxidation was partly restored in A549 cells 
cultured with DCA under hypoxia (Fig. 3d and Supplementary Fig. 16f), 
suggesting that hypoxia-induced PDK1 expression contributes to the 
use of reductive carboxylation for fatty-acid synthesis. 

The von Hippel-Lindau (VHL) tumour suppressor protein is fre- 
quently lost in renal cell carcinoma (RCC) and results in a state of 
‘pseudohypoxia’ by activating HIF signalling** °°. To understand further 
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Figure 3 | Reductive TCA metabolism increases under hypoxia. MRCS cells 
were cultured under normoxia or hypoxia for 3 days in the presence of tracer. 
a, Relative level of glucose oxidation as determined by M2 labelling from 
[U-'°Cg] glucose (see Supplementary Fig. 16a for atom transition map). M2 
isotopologues were the most abundant labelled metabolites in mass spectra. 
b, Relative abundance of citrate and «KG. c, Relative contribution of reductive 
glutamine metabolism to TCA metabolites, determined by M1 labelling from 
[1-'°C] glutamine. d, Contribution of glucose oxidation and glutamine 
reduction to lipogenesis in A549 cells cultured with or without 5mM DCA. 
Error bars, s.e.m. (a—c) (n = 3) and 95% confidence intervals (d). *P < 0.01 and 
**P < 0.001 comparing normoxia to hypoxia. ***P < 0.05 comparing control 
with DCA in hypoxia. 


the role of this pathway in promoting the switch to reductive TCA 
metabolism we tested RCC cells deficient in VHL using ISA. 
Remarkably, VHL-deficient RCC cell lines preferentially used reductive 
glutamine metabolism for lipogenesis, even when cultured under 
normal oxygen levels, whereas those expressing wild-type (WT) 
VHL behaved similarly to other carcinoma cell lines (Fig. 4a and 
Supplementary Fig. 17). Re-expression of WT VHL in previously 
VHL-deficient cell lines resulted in a switch back to oxidative glucose 
metabolism as the source of carbon for lipid synthesis (Fig. 4b), 
reduced extracellular fluxes of glucose, lactate and glutamine (Sup- 
plementary Fig. 18a), and increased the pool of intracellular citrate rela- 
tive to “KG (Supplementary Fig. 18b) under normoxia. Furthermore, 
shRNA-mediated knockdown of HIF-2« partly restored glucose- 
mediated lipogenesis in 786-O cells (Fig. 4c, d). Consistent with 
VHLand HIF influencing the switch to reductive glutamine metabolism 
during hypoxia, glucose entry into the TCA cycle by PDH was increased 
under normoxia upon introduction of WT VHL or knockdown of HIF- 
2a in 786-O cells (Supplementary Fig. 18c). Similar changes were 
observed after knockdown of the HIF-« dimerization partner ARNT 
(aryl hydrocarbon receptor nuclear translocator) in VHL-deficient 
normoxic UMRC2 cells, which express both HIF-l% and HIF-2a 
(ref. 27), or after ARNT knockdown in hypoxic A549 and 143B cells 
(Supplementary Fig. 19). 

Our results highlight an important role for reductive TCA metabolism 
of glutamine in cell proliferation at physiological oxygen levels 
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Figure 4 | HIF/ARNT/VHL signalling regulate carbon use for lipogenesis. 
a-c, Contribution of glucose oxidation ({U-Cg] glucose) and glutamine 
reduction ([5-'*C] glutamine) to lipogenesis in RCC lines (a), parental control 
(PRC3) and VHL+ (WTS) cells derived from 786-O line (b) or vector control 
(pTV) or HIF2a shRNA (pTR) cells derived from 786-O line (c). d, Western 
blot to determine HIF-2« levels for cells in b, c. Error bars, 95% confidence 
intervals obtained from ISA model. *P < 0.05. e, Model depicting the metabolic 
reprograming of mammalian cells by hypoxia or VHL loss to use reductive 
glutamine metabolism for lipogenesis. HIF stabilization drives transcription of 
PDK1, which decreases PDH activity and subsequently intracellular citrate 
levels. IDH1 and ACOI reductively generate lipogenic citrate from glutamine- 
derived «KG. DCA can inhibit PDKs, forcing increased glucose oxidation in 
hypoxic cells. 


(Fig. 4e). Given the almost exclusive use of reductive carboxylation for 
lipogenesis under hypoxia, a redundant or contributing role of mito- 
chondrial IDH2 in this pathway is probable; however, our data provide 
evidence that the reductive pathway involves IDH1-mediated catalysis 
in the cytoplasm. Although the carbon source that cells use for 
lipid synthesis appears to be determined, at least partly, by HIF- 
mediated regulation of PDK1, additional hypoxia-associated changes 
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may also promote reductive glutamine metabolism. For example, 
HIF-20 enhances c-MYC activity**, which in turn drives glutamine 
catabolism through the regulation of numerous genes including 
glutaminase”’. This metabolic reprogramming provides an effective, 
glucose-independent means of generating AcCoA for biosynthesis. 
Because glucose is also delivered to cells through the vasculature, it may 
be limited in microenvironments with decreased oxygen availability”. 
Reductively metabolizing amino acids for lipid synthesis under these 
conditions would allow cells to conserve glucose for production of 
ribose and other biosynthetic precursors (for example, one carbon pool, 
hexosamines) that are not typically generated from other nutrients. 
Thus, reductive metabolism may allow cells to distribute available 
nutrients more efficiently in poorly vascularized microenvironments. 
These results add a new dimension to our understanding of cell 
metabolism and suggest potential therapeutic targets along the reduc- 
tive carboxylation and glutamine catabolic pathways that could 
mitigate hypoxic tumour growth. 


METHODS SUMMARY 


For determination of steady-state labelling of polar metabolites, cells were cultured 
for approximately 24h in the presence of '*C-labelled glutamine or glucose before 
extraction. For experiments involving stable isotopic labelling of lipid biomass, 
cells were grown for approximately 3-5 days in the presence of tracer before 
extraction. Details of the extraction and derivatization methods are described in 
the Supplementary Methods. Computational determination of metabolic fluxes, 
confidence intervals, de novo lipogenesis and the contribution of tracers to fatty- 
acid carbon was accomplished using an in-house software package, Metran’*. 
Details of the metabolic networks and gas chromatography/mass spectrometry 
(GC/MS) measurements used for modelling and complete results are described in 
Supplementary Information. The generation of cells stably expressing control 
shRNAs or those targeted IDH1 or IDH2 is described in Supplementary 
Methods; all experiments were conducted within four passages of initial selection. 
Hypoxic microenvironments were generated by feeding incubators with a pre- 
mixed gas composed of 1% O2, 5% CO, and 94% N>, and O, levels were confirmed 
to range between 1 and 3% using a Fyrite combustion analyser. For details of 
recombinant IDH1 production and enzyme assays, T-cell activation, medium 
analysis, [5-'“C]glutamine experiment and western blotting, see Supplementary 
Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culture, isotopic labelling and hypoxia. All cell lines were cultured in 
Dulbecco’s modified Eagle medium (DMEM; Mediatech) containing 10% fetal 
bovine serum (FBS; Invitrogen) and 100UmI™' penicillin/streptomycin 
(Mediatech) unless otherwise mentioned. Cell lines were obtained from ATCC 
unless otherwise noted. The VC3 glioma cell line was provided by 
T. Lautenschlaeger, MDA-MB-231 and HCT116 cell lines were provided by 
F. Chiaradonna, MRCS cells were provided by S. Lippard, SN12C, ACHN and 
786-O cells were provided by K. Courtney and L. Cantley, MCF10A cells were 
provided by J. Brugge and Huh7 cells were provided by M. Hemann. MCFI0A 
cells were cultured in custom DMEM/F12 (Hyclone) containing 5% horse 
serum, 20ngml_' EGF, 10 ug ml’ insulin, 100 ng ml * cholera toxin, 0.5 mg 
ml’ hydrocortisone, 100 Uml’ penicillin/streptomycin, and labelled/unla- 
belled glucose and glutamine at 18 mM and 2.5 mM, respectively. PRC3, WT8, 
pTV and pTR cell lines were subcloned from the 786-O cell line as previously 
described***’. For isotopic labelling experiments, cells were cultured in six-well 
plates in glucose- and glutamine-free DMEM (Sigma) containing 10% dialysed 
FBS (Invitrogen), 100 U ml’ penicillin/streptomycin, naturally labelled glucose 
or glutamine, and the appropriate tracer, including fue.) glutamine (Isotec), 
[5-'°C] glutamine (C/D/N Isotopes), [1-'°C] glutamine, [U-'*Cg]glucose and 
[1,2-?C,]glucose (all from Cambridge Isotope Labs). Steady-state labelling of 
organic and amino acids was accomplished by culturing sub-confluent cells in 
tracer medium for 24h (Supplementary Fig. 20). Labelling of fatty acids for ISA 
was conducted over 3-5 days of culture in an excess of tracer medium (3-4 ml 
per well in a six-well plate) to prevent nutrient depletion. Hypoxic culture was 
conducted by feeding a custom mixture of 1% O2, 5% CO, and 94% N; to a 
standard incubator controlled at 5% CO . HEPES (Mediatech) was added to the 
culture medium at 20mM to maintain pH between normoxic and hypoxic 
cultures. The internal gas content was monitored using Fyrite gas analysers 
(Bacharach) for CO and Oz. O> levels were confirmed at 1-3% during hypoxic 
culture. 

Isolation and culture of primary CD8* T lymphocytes. Whole spleens were 
isolated from OT-1 mice (Jackson Laboratories) expressing ovalbumin 
(SIINFEKL)-specific T-cell receptors on CD8* T cells**. Spleens were passed 
through 40-um cell strainers, and red blood cells were removed by lysis (ACK 
buffer, Gibco). Splenocytes were magnetically enriched for CD8* T cells by nega- 
tive selection (Stem Cell Technologies) then suspended in DMEM containing 10% 
dialysed serum, 100 U ml’ penicillin/streptomycin and 1X MEM non-essential 
amino-acid supplement, and labelled/unlabelled glucose (25 mM) and glutamine 
(4mM). Cells were cultured for 6days in 12-well plates under hypoxia and 
expanded by addition of 10 pg ml” ' SIINFEKL peptide. Media was supplemented 
with 30U ml ! of recombinant interleukin-2 every 48h, and cells were main- 
tained at a concentration between 1 X 10° and 2.5 X 10° cells per millilitre by 
adding additional medium. To verify proliferation, cells were incubated with 
2 uM carboxyfluorescein succinimidyl ester (CSFE) in PBS + 2% FBS for 5 min, 
washed three times in buffer, re-suspended in media and cultured as above. 
Viability and T-cell enrichment were determined by flow cytometry after staining 
with 4’,6-diamidino-2-phenylindole (DAPI) and fluorescent anti-CD8 antibody. 
T-cell proliferation was quantified by flow cytometry using a FACSCanto II 
equipped with an High Throughput system (Becton Dickinson) and analysed 
using FlowJo 7.5.5 (TreeStar). 

Lentiviral-mediated generation of cells with knockdown of IDH1/2. Stable cell 
cultures with decreased IDH1 and IDH2 expression were generated by lentiviral- 
mediated shRNA expression. pLKO.1 lentiviral vectors targeting IDH1 had 
shRNA sequences of CCGGGCTGCTTGCATTAAAGGTTTACTCGAGTAAA 
CCTTTAATGCAAGCAGCTTTTT (IDH1a; TRCN0000027298) and CCGGCG 
AATCATTTGGGAATTGATTCTCGAGAATCAATTCCCAAATGATTCGTTT 
TT (IDH1b; TRCN0000027289), IDH2 had shRNA sequence CCGGGTGGA 
CATCCAGCTAAAGTATCTCGAGATACTTTAGCTGGATGTCCACTTTTT 
(TRCN0000027225). For controls, either non-targeting control shRNA (SHC002; 
Sigma) or pLKO.1 scrambled control vector** (Addgene) were used. pLKO.1 vector 
targeting ARNT with shRNA sequence CCGGGAGAAGTCAGATGGTTTATTT 
CTCGAGAAATAAACCATCTGACTTCTCTTTTT (TRCN0000003819) was 
obtained from Open Biosystems. HEK293T cells were co-transfected with 
pLKO.1 vectors and packaging plasmids to produce lentivirus. Filtered superna- 
tants were used for infection, and cells were selected with puromycin (2 tg ml’) for 
at least two passages before initiating tracer and flux experiments. 

Metabolite extraction and GC/MS analysis. At the conclusion of culture, cells 
were rinsed with 1 ml ice-cold PBS and quenched with 0.4 ml ice-cold methanol. 
An equal volume of water was added, and cells were collected in tubes by scraping 
with a pipette. One volume of ice-cold chloroform was added to each tube, and the 
extracts were vortexed at 4 °C for 30 min. Samples were centrifuged at 14,000g for 
5 min at room temperature. For analysis of polar metabolites, the aqueous phase 
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was transferred to a new tube for evaporation under airflow. For ISA experiments, 
the non-polar fraction was collected and evaporated under airflow. In some ISA 
experiments, cells were trypsinized, counted and pelleted before lipid extraction as 
described above. 

Dried polar metabolites were dissolved in 20 ll of 2% methoxyamine hydro- 
chloride in pyridine (Pierce) and held at 37 °C for 1.5h. After dissolution and 
reaction, tert-butyldimethylsilyl (TBDMS) derivatization was initiated by add- 
ing 30 pl N-methyl-N-(tert-butyldimethylsilyl)trifluoroacetamide (MBTSTFA) 
+ 1% tert-butyldimethylchlorosilane (TBDMCS; Pierce) and incubating at 
55 °C for 1h. Fatty-acid methyl esters were generated by dissolving and reacting 
dried chloroform fractions in 50-100 ul of Methyl-8 reagent (Pierce) and 
incubating at 60 °C for 1h. GC/MS analysis was performed using an Agilent 
6890 GC equipped with a 30m DB-35MS capillary column connected to an 
Agilent 5975B MS operating under electron impact ionization at 70 eV. One 
microlitre of sample was injected in splitless mode at 270 °C, using helium as the 
carrier gas at a flow rate of 1 ml min '. For measurement of organic and amino 
acids, the GC oven temperature was held at 100 °C for 3 min and increased to 
300 °C at 3.5° min”. For analysis of fatty-acid methyl esters, the GC temper- 
ature was held at 100°C for 5 min after injection, increased to 200°C at 15° 
min |, then to 250 °C at 5° min | and finally to 300 °C at 15° min™'. The MS 
source and quadrupole were held at 230°C and 150°C, respectively, and the 
detector was run in scanning mode, recording ion abundance in the range of 
100-605 m/z. Mass isotopomer distributions were determined by integrating 
the appropriate ion fragments listed in Supplementary Table 1. When required, 
mass isotopomer distributions were corrected for natural isotope abundance 
using in-house algorithms adapted from ref. 35. Relative metabolite abun- 
dances were measured using a norvaline internal standard and integrating all 
potentially labelled ions in the metabolite fragments listed in Supplementary 
Table 1. 

ISA and metabolic flux analysis. Computational estimation of fluxes or per- 
centage enrichment in the AcCoA pool and their associated 95% confidence 
intervals were accomplished using the elementary metabolite unit (EMU)-based 
software Metran, executed within Matlab (Mathworks) as previously 
described**°. Briefly, fluxes were determined iteratively by simulating MS 
measurements from a given flux vector and comparing with mass isotopomer 
distribution measurements (three biological replicates). Upon obtaining an 
acceptable fit, confidence intervals were determined for each flux using 
parameter continuation. Assumptions, network (including atom transitions), 
raw data and model fits are presented in Supplementary Information. ISA was 
performed in a similar manner using the simple network described in 
Supplementary Fig. 4 and Supplementary Table 6 to determine the tracer enrich- 
ment in lipogenic AcCoA (D value) and percentage of newly synthesized lipids 
and de novo lipogenesis, g(t) (ref. 15). Uncorrected mass isotopomer distribu- 
tions, fitted parameters and confidence intervals used for ISA are listed in 
Supplementary Information. 

Calculation of absolute flux of tracers to palmitate in biomass. The quantity of 
newly synthesized palmitate was determined by multiplying the fractional newly 
synthesized palmitate value (g(t) value from ISA) by the total cellular palmitate. 
Total cellular palmitate was quantified by GC/MS using a triheptadecanoin 
internal standard. Flux of a given tracer to palmitate was calculated by multiplying 
the tracer contribution (D value from ISA) by the amount of newly synthesized 
palmitate and dividing by the integral viable cell density over the course of the 
experiment. 

Detection of '*C incorporation into lipids. One hundred and fifty thousand 
A549 cells were plated in DMEM. After attachment, [5-'*C] glutamine was added 
to the medium at a final concentration of 1 Ciml™ ' After 72h of growth, lipids 
were extracted from plates using two rinses of 500 pil hexane:isopropanol (3:2). 
Three hundred microlitres of PBS was added to induce phase separation and 
collect the non-polar phase, and an additional 300 pl was added to the hexane 
fraction to rinse the non-polar fraction. The hexane fraction was dried under 
nitrogen gas, and the residue was dissolved in 200 ul chloroform before quantify- 
ing c.p.m. by liquid scintillation counting. Matched plates of cells were counted to 
determine cell number. 

Metabolite analysis of spent medium. Glucose, lactate, glutamine and glutamate 
concentrations were measured in fresh and spent medium samples using a Yellow 
Springs Instruments 7100. Cell number was determined using a hemocytometer. 
Extracellular flux measurements were calculated by assuming exponential growth 
over the culture period to determine integral viable cell density. 

Purification of recombinant IDH1. His-tagged IDH1 in pET41a was trans- 
formed into Escherichia coli (BL21 plysS DE3) and cells were grown with 
kanamycin selection to an Deo9 nm Of 0.6. The cells were then moved to 18°C 
and induced with 1 mM IPTG for 16-18 h, pelleted and subjected to freeze/thaw 
before re-suspension in 60 ml lysis buffer (20 mM Tris, pH 7.4, 0.1% Triton 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


X-100, 500 mM NaCl, 5mM f-mercaptoethanol, 10% glycerol, supplemented 
with protease inhibitors). Cells were lysed by sonication and protein bound to 
Ni-NTA agarose. The beads were batch washed three or four times with wash 
buffer (20mM Tris, pH 7.4, 500mM NaCl, 5mM f-mercaptoethanol, 10% 
glycerol), then eluted from a column in 1-ml fractions with elution 
buffer (20mM Tris, pH 7.4, 500mM NaCl, 5mM_ f-mercaptoethanol, 
500 mM imidazole, 10% glycerol). The first and second fractions containing 
most of the protein were dialysed into 50 mM Tris pH 7.5, 200 mM NaCl, 5 mM 
B-mercaptoethanol, 2mM MnCl), 10% glycerol and the recombinant enzyme 
was stored at —80°C. 

Recombinant IDH1 enzyme assays. All reactions were performed in reaction 
buffer (100mM Tris pH 7.5, 1.3mM MnCh, 200M NADPH and 2mM 
a-ketoglutarate), which was equilibrated overnight at 0%, 5% or 10% CO, as 
indicated. One hundred microlitres of reaction buffer for each CO, condition 
was added to 10,1g rIDH1 and activity was measured by following NADPH 
fluorescence (excitation at 340 nm, emission at 460 nm). 

SDS-PAGE and western blotting. Cells were rinsed with ice-cold PBS and 
lysed using RIPA buffer. Proteins were separated by SDS-PAGE and transferred 
to a nitrocellulose membrane. After blocking, membranes were probed with 
goat anti-IDH1 polyclonal antibody (Santa Cruz Biotechnology, sc49996), 
mouse anti-IDH2 monoclonal antibody (Abcam, ab55271), rabbit anti- 
HIF2« polyclonal antibody (Novus Biologicals, NB100-122), mouse anti- 
ARNT1 monoclonal antibody (BD Biosciences, 611079), mouse anti-f-actin 
mouse monoclonal antibody (Novus Biologicals, ab8226), mouse anti-tubulin 
monoclonal antibody (Sigma) or rabbit anti-tubulin antibody (Sigma). Protein 


was detected using horseradish-peroxidase-conjugated secondary antibodies 
and chemiluminescence. 
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Reductive carboxylation supports growth in tumour 
cells with defective mitochondria 
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Mitochondrial metabolism provides precursors to build macro- 
molecules in growing cancer cells’’. In normally functioning 
tumour cell mitochondria, oxidative metabolism of glucose- and 
glutamine-derived carbon produces citrate and acetyl-coenzyme A 
for lipid synthesis, which is required for tumorigenesis’. Yet some 
tumours harbour mutations in the citric acid cycle (CAC) or elec- 
tron transport chain (ETC) that disable normal oxidative mito- 
chondrial function*’, and it is unknown how cells from such 
tumours generate precursors for macromolecular synthesis. Here 
we show that tumour cells with defective mitochondria use 
glutamine-dependent reductive carboxylation rather than oxidative 
metabolism as the major pathway of citrate formation. This 
pathway uses mitochondrial and cytosolic isoforms of NADP*/ 
NADPH-dependent isocitrate dehydrogenase, and subsequent 
metabolism of glutamine-derived citrate provides both the acetyl- 
coenzyme A for lipid synthesis and the four-carbon intermediates 
needed to produce the remaining CAC metabolites and related 
macromolecular precursors. This reductive, glutamine-dependent 
pathway is the dominant mode of metabolism in rapidly growing 
malignant cells containing mutations in complex I or complex III of 
the ETC, in patient-derived renal carcinoma cells with mutations in 
fumarate hydratase, and in cells with normal mitochondria sub- 
jected to acute pharmacological ETC inhibition. Our findings reveal 
the novel induction of a versatile glutamine-dependent pathway 
that reverses many of the reactions of the canonical CAC, supports 
tumour cell growth, and explains how cells generate pools of CAC 
intermediates in the face of impaired mitochondrial metabolism. 
We first studied the metabolism of isogenic 143B human osteosarcoma 
cells that contained or lacked a loss-of-function mutation in ETC com- 
plex III (cytochrome b-cl complex). These cell lines were generated by 
depleting 143B mitochondrial DNA (mtDNA) and repopulating with 
either wild-type mtDNA or mtDNA containing a frameshift mutation 
in the gene encoding cytochrome b (CYTB: also known as MT-CYB), an 
essential complex III component’. Despite lack of respiration and com- 
plex III function in the mutants’, both wild-type (WT 143B) and CYTB- 
mutant (CYTB 143B) cells form colonies in soft agar’ and proliferate at 
comparable rates (Supplementary Fig. 1a), making these cells a good 
model in which to study growth during mitochondrial dysfunction. 
Both cell lines had detectable CAC intermediates, although citrate 
was less abundant and succinate was significantly more abundant in 
the CYTB 143B cells (Supplementary Fig. 1b). As expected for cells 
with defective oxidative phosphorylation, CYTB 143B cells had higher 
glucose consumption and lactate production than WT 143B cells, indi- 
cating a metabolic shift towards aerobic glycolysis (Fig. 1a). To deter- 
mine the effects of CYTB mutation on the metabolic fates of glucose, we 
cultured both cell lines in medium containing D [U-8C] glucose (here U 
indicates uniformly labelled) and measured ‘°C enrichment of intra- 
cellular metabolites by mass spectrometry. In WT 143B cells, most 


citrate molecules contained glucose-derived '*C (Fig. 1b). Citrate 
m-+2 (citrate containing two additional mass units from 13C) results 
from oxidative decarboxylation of glucose-derived pyruvate by 
pyruvate dehydrogenase (PDH) to form  [1,2-'*C]acetyl-coA 
(coA, coenzyme A), followed by condensation with an unlabelled 
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Figure 1 | A reductive pathway of glutamine metabolism in cancer cells 
lacking activity of ETC complex III. a, Glucose utilization, lactate secretion 
and glutamine utilization in WT 143B and CYTB 143B cells. b, Mass 
isotopomer analysis of citrate in cells cultured with p[U-'*C] glucose and 
unlabelled glutamine. c, d, Mass isotopomer analysis of citrate and fumarate in 
cells cultured with L[U-'*C] glutamine and unlabelled glucose. Data are the 
average + s.d. for three independent cultures. *P < 0.05; **P < 0.005, 
Student’s t-test. e, Schematic of glutamine metabolism in WT 143B and CYTB 
143B cells. Coloured arrows follow the paths of glutamine-derived carbon. 
Abbreviations: Ac-CoA, acetyl-CoA; OAA, oxaloacetate; Gln, glutamine; Glu, 
glutamate; «KG, «-ketoglutarate; Succ-CoA, succinyl-CoA; Fum, fumarate; 
Mal, malate; PDH, pyruvate dehydrogenase; ACL, ATP-citrate lyase; IDH, 
isocitrate dehydrogenase; ~KGDH, «-ketoglutarate dehydrogenase. 
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oxaloacetate (OAA). Processing of citrate m+2 around one turn of the 
CAC produces citrate m+4 (Supplementary Fig. 2). In CYTB 143B 
cells, most citrate contained no glucose carbon (m+0), indicating 
suppressed PDH contribution to acetyl-CoA (Fig. 1b). Fumarate and 
malate m+2 were also decreased (Supplementary Fig. Ic, d). 
Glutamine is a major respiratory substrate in cancer cells, providing 
energy and carbon for growth*”®. Such cells use glutamine to supply 
anaplerosis, the replenishment of CAC intermediates withdrawn from 
the cycle to feed biosynthetic pathways. Both WT and CYTB 143B cells 
require glutamine for colony formation, implying a respiration- 
independent function for glutamine in cell growth’, and both cell types 
consumed glutamine at similar rates (Fig. la). We cultured both cell 
lines with L[U-'*C] glutamine to define the metabolic fates of glutamine. 
Like other glutamine-dependent cancer cells'’, WT 143B cells used 
glutamine as the major anaplerotic precursor, resulting in a large 
amount of fumarate, malate and citrate m+4 (Fig. 1c, d; Supplemen- 
tary Figs le and 3, top). In contrast, CYTB 143B cells produced only 
trace quantities of citrate m+4. Instead, they produced citrate m+5 
through reductive carboxylation of glutamine-derived «-ketoglutarate 
(Fig. 1c). This reaction involves addition of an unlabelled carbon by 
isocitrate dehydrogenase (IDH) acting in reverse relative to the canonical 
oxidative CAC”. Cleavage of citrate m+5 by ATP-citrate lyase then 
produces acetyl-CoA m+2 and OAA m+3, with the unlabelled carbon 
retained on OAA (Supplementary Fig. 3, bottom). Examination of other 
CAC metabolites in CYTB 143B cells revealed that they were formed 
downstream of citrate m+5 and OAA m+3. These cells contained 
abundant malate and fumarate m+3, which was essentially absent 
from WT 143B cells (Fig. 1d and Supplementary Fig. le). Thus these 
metabolites are generated from reductive, rather than oxidative, 
glutamine metabolism (Fig. le). Succinate was formed through both 
reductive and oxidative glutamine metabolism in CYTB 143B cells 
(Supplementary Fig. 1f). Glutamine-dependent reductive carboxyla- 
tion was also observed in CYTB 143B cells cultured in medium con- 
taining a full complement of amino acids (Supplementary Fig. 4), and 
in cells with a genetic defect in ETC complex I (Supplementary Fig. 5). 
To determine which of the three IDH isoforms contributed to 
reductive carboxylation, we used RNA interference (RNAi) to silence 
expression of IDH1 (isocitrate dehydrogenase 1 (NADP*), soluble), 
IDH2 and IDH3 in CYTB 143B cells and measured glutamine-derived 
°C labelling in citrate. Silencing either of the NADP*’/NADPH- 
dependent IDH isoforms, IDH1 or IDH2, reduced citrate m+5 in cells 
cultured with L[U-'*C]glutamine and silencing both at once further 
reduced citrate m+5 (Fig. 2a, b). Conversion of glutamine to glutamate 
was unaffected (Fig. 2b). Silencing IDH3, the mitochondrial NAD*/ 
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Figure 2 | NADP*/NADPH-dependent isoforms of isocitrate 
dehydrogenase contribute to reductive carboxylation. a, Transient silencing 
of isocitrate dehydrogenase (IDH) proteins in CYTB 143B cells with short 
interfering RNAs (siRNAs) directed against [DH1, IDH2 or IDH3. An siRNA 
directed against luciferase (Luc) was used as a negative control. When IDH1 
and IDH2 siRNAs were transfected concurrently (IDH1+IDH2), a transfection 
with the same nanomolar amount of Luc siRNA (2X Luc) was used as a negative 
control. b, Mass isotopomer analysis of citrate and glutamate in CYTB 143B 
cells cultured with L[U-!°C] glutamine after silencing of IDH isoforms. Data are 
the average = s.d. of three independent cultures. *P < 0.05, Student’s t-test. 
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NADH-dependent isoform, had no effect on labelling (Fig. 2a, b). Thus 
reductive carboxylation in CYTB 143B cells involves IDH1 and IDH2, 
but not IDH3. Because IDH1 and IDH2 proteins are localized primarily 
in the cytoplasm and mitochondria, respectively, of both cell lines 
(Supplementary Fig. 6), the data suggest that reductive carboxylation 
in CYTB 143B cells may occur in either compartment. 

To determine whether IDH1 and IDH2 are required for growth, we 
stably silenced each enzyme using lentiviral RNAi. In WT 143B cells, 
silencing either isoform reduced cell growth (Supplementary Fig. 7), 
consistent with findings in other cancer cells with normal mitochon- 
dria'*. In CYTB 143B cells, a similar growth reduction occurred when 
either isoform was silenced (Supplementary Fig. 7). Thus both IDH1 
and IDH2 support growth in cells that use glutamine-dependent 
reductive carboxylation. 

We next determined whether reductive glutamine metabolism con- 
tributed to lipogenesis, a biosynthetic activity required for cancer cell 
growth. Both WT and CYTB 143B cells produced fatty acids de novo, as 
indicated by incorporation of 30 into lipids (Fig. 3a). The two cell 
lines were then cultured with ‘C-labelled lipogenic substrates to 
examine carbon preferences for lipogenesis (Fig. 3a). Both cell lines 
had an intact pathway oflipogenesis from exogenously supplied acetate. 
The enhanced labelling in CYTB 143B cells probably resulted from an 
increased specific activity of the intracellular two-carbon pool because 
of suppressed PDH. Lipids derived from CYTB 143B cells contained 
little radioactivity from glucose, but abundant radioactivity from 
glutamine, indicating a switch in the extent to which these nutrients 
supplied lipogenesis. To quantify the contribution of glucose and 
glutamine to lipogenic acetyl-CoA, cells were cultured in medium 
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Figure 3 | Glutamine is the major lipogenic precursor in cells lacking 
oxidative CAC function. a, Cells were cultured in medium containing : H,0 
(21 vol.%), or “C tracers of acetate (ac, 5 uCi), glucose (glc, 10 Ci) or 
glutamine (gln, 10 \1Ci). Lipids were analysed for °H or ‘*C content by 
scintillation counting. In the '“C experiments, raw counts per 10° cells were 
normalized to the absolute rate of fatty acid synthesis established using *H,O. 
*P < 0.05, Student’s t-test. b, Schematic outlining synthesis of fatty acids (FA) 
from acetyl-CoA (Ac-CoA) and the source of the triplet and doublet at @ — lin 
3C NMR spectroscopy. ¢, d, °C NMR of lipids labelled with p[U-'°C] glucose 
(c) or L[U-°C] glutamine (d). Insets are expansions of the @ — 1 resonance. 
Abbreviations: s, singlet; d, doublet; t, triplet. 
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containing either p[U-'*C] glucose and unlabelled glutamine, or 
L[U-'°C]glutamine and unlabelled glucose. Extracted lipids were 
analysed by °C NMR. Because fatty acids are synthesized by the 
sequential addition of two-carbon units, analysing ‘*C-"*C coupling 
between the @ — 1 and w — 2 carbons—which enter the fatty acyl 
chain on different two-carbon units—enables calculation of '°C 
enrichment in lipogenic acetyl-CoA’. In the m—1 multiplet, 
®C-C coupling produces a triplet when two '’C-labelled two- 
carbon units are added in succession, whereas one labelled and one 
unlabelled unit produce a doublet (Fig. 3b). Therefore '*C labelling in 
lipogenic acetyl-CoA is proportional to the contribution of the triplet 
to the overall area of the @ — 1 multiplet. In WT 143B cells cultured 
with p[U-°C] glucose, the @ — 1 resonance was dominated by the 
triplet, which accounted for 65% of the total area (Fig. 3c, left). 
Glutamine was a minor source of fatty acyl carbon, as the triplet 
accounted for only 15% of the area when the cells were cultured with 
L[U-'?C] glutamine (Fig. 3d, left). This pattern was reversed in the 
CYTB 143B cells, with glutamine providing 67% of the lipogenic carbon 
(Fig. 3c, d, right; Supplementary Fig. 8a). Thus, reductive glutamine 
metabolism was the major pathway of fatty acid production in newly 
synthesized lipids during CYTB 143B growth. 

Although most cancer cells have functional mitochondria, a subset 
of human tumours harbours mutations that impair mitochondrial 
metabolism. Two major classes of mutations occur in genes required 
for function of the CAC enzymes succinate dehydrogenase (SDHA, 
SDHB, SDHC, SDHD, SDHAF2) or fumarate hydratase (FH). To test 
whether glutamine-dependent reductive carboxylation occurs in 
tumour cells harbouring these mutations, we studied UOK262 cells 
derived from a renal tumour ina patient with hereditary leiomyomatosis 
renal cell carcinoma syndrome. In this syndrome, affected individuals 
inherit one loss-of-function FH allele and their tumours display loss of 
heterozygosity for the FH locus’*. UOK262 cells are defective in respira- 
tion and devoid of FH activity’’. They failed to survive culture in glucose- 
free medium containing galactose and glutamine (Fig. 4a), a condition 
that forces cells to use mitochondrial metabolism to generate ATP’®. Lack 
of FH activity precludes the use of glutamine as a carbon source in the 
conventional oxidative CAC. Yet the UOK262 cells required glutamine 
to proliferate (Fig. 4b). Culture of these cells with p[U-'*C] glucose 
produced mass isotopomer data similar to CYTB 143B cells, with 
minimal formation of citrate m+ 2 (Fig. 4c, left). In cells cultured with 
L[U-°C] glutamine, the most abundant mass isotopomer of citrate was 
m+5 formed by reductive carboxylation (Fig. 4c, left). Fumarate was 
predominantly formed through oxidative glutamine metabolism, 
producing fumarate m+4 (Fig. 4c, middle). However, a large fraction 
of the malate pool (m+3) was formed via citrate cleavage and sub- 
sequent reductive metabolism (Fig. 4c, right). Lipid labelling using 
p[U-°C} glucose or L[U-°C] glutamine revealed that, like CYTB 
143B cells, glutamine is the major carbon source for fatty acid 
synthesis (Supplementary Fig. 8b). Thus these cancer cells produce 
CAC intermediates using two distinct pathways to compensate for the 
loss of FH enzyme activity: oxidative metabolism of glutamine to 
fumarate, and glutamine-dependent reductive carboxylation to pro- 
duce citrate, acetyl-CoA for lipogenesis and remaining four-carbon 
CAC intermediates. 

Finally, we tested whether glutamine-dependent reductive carboxy- 
lation was unique to cells with permanent abnormalities in mitochon- 
drial metabolism. WT 143B cells and transformed mouse embryonic 
fibroblasts (MEFs) were subjected to acute ETC inhibition. Exposure to 
inhibitors of complex III (antimycin) or complex I (rotenone) induced 
a switch from oxidative to reductive metabolism of glutamine (Fig. 4d, 
Supplementary Fig. 9a). Enhanced reductive carboxylation was 
evident one hour after ETC inhibition and was not eliminated by 
pre-treatment with the protein synthesis inhibitor cycloheximide (Sup- 
plementary Fig. 9b, c). Metformin, an antidiabetic agent that inhibits 
respiration through effects on complex I’’, also induced glutamine- 
dependent reductive carboxylation at high doses (Fig. 4d, Supplementary 
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Figure 4 | Glutamine-dependent reductive carboxylation in renal 
carcinoma cells lacking fumarate hydratase and during ETC inhibition in 
tumour cells with normal mitochondria. Growth of UOK262 renal 
carcinoma cells in complete medium, in medium in which glucose was replaced 
with galactose (a), and in medium lacking glutamine (b). c, Mass isotopomer 
analysis of citrate, fumarate and malate from UOK262 cells cultured in medium 
containing p[U-'*C] glucose and unlabelled glutamine, or L[U-'*C]glutamine 
and unlabelled glucose. d, Mass isotopomer analysis for citrate, fumarate and 
malate from WT 143B cells cultured in medium containing unlabelled glucose 
and L[U-'°C]glutamine. The labelled glutamine was introduced at the same 
time the cells were treated with vehicle (DMSO), metformin, rotenone or 
antimycin. Metabolites were extracted 6 h later. In all experiments, data are the 
average + s.d. of three independent cultures. 


Fig. 9a). Interestingly, doses that mimicked plasma concentrations in 
patients with acute metformin toxicity and lactic acidosis induced 
reductive glutamine metabolism (Supplementary Fig. 10). These data 
demonstrate that glutamine-dependent reductive carboxylation is a 
common cellular response to impaired mitochondrial metabolism. 

Glutamine-dependent reductive carboxylation has been described 
previously as a minor source of isocitrate/citrate and lipogenic carbon 
in mammalian cells'*"*'*°. In Plasmodium falciparum, a unicellular 
protozoan with limited respiration, the pathway is a major source of 
citrate and acetyl units*’. We show that this pathway can also function 
as the predominant metabolic strategy to produce lipids for cancer cell 
growth, and can be induced acutely during ETC inhibition. We specu- 
late that reductive carboxylation is stimulated by a disturbance in the 
redox ratio of the mitochondria caused by ETC impairment, decreas- 
ing the NAD*/NADH ratio and rendering oxidative function of the 
CAC less efficient. This ratio could be dissipated in part by NAD(P)- 
transhydrogenases, which transfer reducing equivalents from NADH 
to NADPH, and this in turn may drive NADPH-dependent reductive 
carboxylation by IDH1 and IDH2. This redirection of the CAC 
extends the versatility of glutamine metabolism in cancer cell growth. 
In cells with normal mitochondria, glutamine oxidation is frequently 
the major source of anaplerosis, providing most of the carbon to pools 
of CAC intermediates and facilitating production of macromolecular 
precursors in the mitochondria”. Glutamine-dependent reductive 
carboxylation enables glutamine to retain this status as a growth- 
promoting nutrient even when mitochondrial metabolism is impaired. 
Therefore, during malignant transformation the acquisition of somatic 
mutations that impair mitochondrial function and stimulate aerobic 
glycolysis (the Warburg effect) does not diminish the importance of 
glutamine metabolism or of the mitochondria themselves for tumour 
cell proliferation. 
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METHODS SUMMARY 

WT 143B, CYTB 143B, VOK262, CCL16-B2 and CCL16-NDII cells were cultured 
as described”’***. MEFs were cultured in Dulbecco’s modified Eagle medium 
(DMEM) with 10% fetal bovine serum (FBS, Hyclone), penicillin/streptomycin 
and 6 mM L-glutamine. Cell growth was monitored in sub-confluent cultures by 
trypsinizing and counting with a haemacytometer. Glucose, lactate and glutamine 
were measured as described”. Isotopic labelling was performed in DMEM with 
10% dialysed FBS supplemented with either 15mM p[U-'*C]glucose or 2mM 
L[U-'?C] glutamine. Aqueous metabolites were analysed using an Agilent 6970 gas 
chromatograph and an Agilent 5973 mass selective detector. To determine relative 
metabolite abundance across samples, the area of the total ion current peak for the 
relevant metabolite was compared to that of an internal standard (2-oxobutyrate) 
and normalized for protein content. Analysis of '*C enrichment and mass 
isotopomer distribution was performed as published'!”’. For radioisotope lipid 
synthesis assays, cells were cultured for 18 h in complete medium supplemented with 
3H,0 or C tracers (Perkin Elmer). Lipids were extracted and analysed as described’. 
For lipid synthesis experiments using NMR, cells were plated in 10-cm dishes and 
allowed to proliferate exponentially in medium containing p[U-'C]glucose and 
unlabelled glutamine, or unlabelled glucose and t[U-’*C] glutamine until two con- 
fluent 15-cm dishes were obtained. Lipids were then extracted and analysed by ‘°C 
NMRas described’. Gene silencing was performed using commercial small inter- 
fering RNAs (Thermo) directed against IDH1, IDH2 or IDH3. Cells were trans- 
fected with DharmaFECT transfection reagent (Thermo) and protein abundance 
was examined after 72h using western blots with commercial antibodies (Santa 
Cruz Biotechnology). Cells were incubated with t[U-'*C] glutamine for two hours 
before extraction of metabolites for mass isotopomer distribution analysis. Stable 
silencing used lentiviral shaRNAs from the Mission shRNA pLKO.1-puro library 
(Sigma). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culture, growth and viability. WT 143B, CYTB 143B, CCL16-B2, CCL16- 
NDI and UOK262 cells were cultured as described”***. Mouse embryonic fibro- 
blasts (MEFs) were cultured in Dulbecco’s modified Eagle medium (DMEM) with 
10% fetal bovine serum (FBS, Hyclone), penicillin/streptomycin and 6mM 
L-glutamine. Doubling times were calculated by repeatedly allowing sub-confluent 
cultures to reach confluence, then trypsinizing, counting and re-plating at low 
confluence. 

Nutrient utilization and secretion. Concentrations of glucose, lactate and 
glutamine in tissue culture medium were measured using a NOVA BioProfile 
Basic4 chemical analyser as described**. Rates of consumption/secretion were 
calculated by determining the change in metabolite concentration over 6h, con- 
verting this change into nmoles consumed or secreted, and normalizing to protein 
content. 

Mass isotopomer distribution and mass spectrometry. Nutrients labelled with 
°C were purchased from Cambridge Isotope Laboratories. All '°C studies were 
performed in medium containing 10% dialysed FBS and prepared so that 100% of 
either the glucose or glutamine pool was labelled with '*C, and the other pool was 
unlabelled. DMEM lacking both glucose and glutamine was prepared from powder 
(Sigma), then supplemented with either 15 mM p[U-'3C] glucose and 2mM un- 
labelled glutamine, or 15mM unlabelled glucose and 2 mM 1[U-"*C]glutamine. 
For experiments with RPMI (Supplementary Fig. 4), we supplemented glutamine- 
free RPMI (11 mM unlabelled glucose, Invitrogen) with 2 mM 1[U-'*C]glutamine. 
Cells were grown in 60- or 100-mm dishes until 80% confluent, then rinsed with 
PBS and cultured with '*C-containing medium for the indicated time (usually 6 h). 
The cells were then extracted by freeze-thawing three times in 0.5 ml of a cold 
1:1 mixture of methanol:water. Macromolecules and debris were removed by 
centrifugation, an internal standard was added (50 nmol of 2-oxobutyrate), and 
the supernatants with aqueous metabolites were evaporated completely and deri- 
vatized for 30 min at 42°C in 100 ul of a trimethylsilyl donor (Tri-Sil, Thermo). 
Metabolites were analysed using an Agilent 6970 gas chromatograph networked to 
an Agilent 5973 mass selective detector. Retention times and mass fragmentation 
signatures of all metabolites were validated using pure standards. To determine 
relative metabolite abundance across samples, the area of the total ion current peak 
for the metabolite of interest was compared to that of the internal standard and 
normalized for protein content. The mass isotopomer distribution analysis mea- 
sured the fraction of each metabolite pool that contained every possible number of 
13C atoms; that is, a metabolite could contain 0, 1,2, ...1 *C atoms, where n = the 
number of carbons in the metabolite. For each metabolite, an informative fragment 
ion containing all carbons in the parent molecule was analysed using MSDChem 
software (Agilent), integrating the abundance of all mass isotopomers from m+0 
to m+n, where m= the mass of the fragment ion without any BC. A list of 
fragment ions is in ref. 11. The abundance of each mass isotopomer was then 
corrected mathematically to account for natural abundance isotopes” and finally 
converted into a percentage of the total pool. 

Lipid synthesis assays. All radioisotopes were from Perkin-Elmer. Culture with 
°H,O was used to measure total lipid synthesis by preparing high-glucose DMEM 
from powder (Sigma) and supplementing with 2 mM L-glutamine and 10% FBS. 
The *H,O was added during preparation of this medium to a final content of 21% 
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of the total volume. The '4C assays also used high-glucose DMEM with 2mM 
L-glutamine, 10% FBS and tracer quantities of [1,2-'*C]acetate (5 Ci, 4.4 uM), 
p[U-"C] glucose (10 Ci, 1.7 uM) or L[U-"*C] glutamine (10 pCi, 1.9 uM). In all 
radioisotope assays, cells were grown to 80-90% confluence then cultured for 18h 
in medium containing the label. Lipids were then extracted and analysed by 
scintillation counting as previously described*. For '*C NMR experiments, cells 
were plated at approximately 50% confluence in 10-cm dishes and allowed to 
proliferate exponentially in complete DMEM containing either 15mM 
p[U-'*C]glucose and 2mM unlabelled glutamine, or 15mM unlabelled glucose 
and 2mM L[U-'*C] glutamine until two confluent 15-cm dishes were obtained. 
Glucose and glutamine concentrations were monitored in the medium to ensure 
that neither nutrient was depleted during expansion of the culture. Lipids were 
then extracted and analysed by '*C NMR as described? using a Varian ANOVA 
14.1 T spectrometer equipped with a 3-mm broadband probe with the observe coil 
tuned to '°C (150 MHz). Relevant peak areas were determined using ACDLabs 
SpecManager (Advanced Chemistry Development). 

RNA interference. Transient gene silencing experiments were performed using 
commercial siRNA pools (siGENOME, Thermo) directed against IDH1, IDH2, 
IDH3 (catalogue numbers MU-008294-00, MU-004013-00 and MU-008753-00, 
respectively). For the simultaneous silencing of IDHI and IDH2, individual 
siRNAs rather than pools were used (IDH1: 5'-GGACUUGGCUGCUUGC 
AUU-3’; IDH2: 5'-CAAGAACUAUGACGGAGAU-3’). Cells were transfected 
with DharmaFECT transfection reagent (Thermo), and protein abundance was 
examined after 72h using western blots with commercial antibodies against 
IDH1, IDH2 or IDH3 (Santa Cruz Biotechnology). Cells were incubated with 
L[U-'?C] glutamine for two hours before extraction of metabolites for mass 
isotopomer distribution analysis. Stable silencing used lentiviral-mediated 
shRNAs from the Mission shRNA pLKO.1-puro library (Sigma). Cells were 
infected with each lentivirus using the manufacturer-supplied protocol, and pools 
of puromycin-resistant cells were used for subsequent experiments. Pools with the 
best silencing of IDH1 or IDH2 were used in the growth assay. 

Subcellular localization studies. WT 143B and CYTB 143B cells were transfected 
with siRNA as described above. After 72 h, the transfected cells were incubated in 
medium containing 1 1M Mitotracker Deep Red FM (Invitrogen) for 30 min. The 
cells were then fixed with 3.7% formaldehyde in PBS for 15 min, permeabilized 
with 0.5% (v/v) Triton X-100 for 5 min, and blocked with 1% bovine serum 
albumin for 5 min. Immunofluorescence used commercial antibodies against 
IDH1 (Santa Cruz Biotechnology) or IDH2 (Abcam) followed by anti-goat or 
anti-mouse secondary antibodies conjugated to Alexa Fluor-488. Cells were 
stained with primary and secondary antibodies for one hour at room temperature 
and mounted on Vectashield mounting medium containing DAPI (Vector 
Laboratories). Images were acquired using an LD Plan-Neofluar 40X/1.3 DIC 
objective on a Zeiss Axioplan 2E imaging deconvolution microscope. 
Mitochondrial fractions were isolated by differential centrifugation as previously 
described”®. 


26. Frezza, C., Cipolat, S. & Scorrano, L. Organelle isolation: functional mitochondria 
from mouse liver, muscle and cultured fibroblasts. Nature Protocols 2, 287-295 
(2007). 
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Differential oestrogen receptor binding is associated 
with clinical outcome in breast cancer 


Caryn S. Ross-Innes', Rory Stark', Andrew E. Teschendorff*, Kelly A. Holmes’, H. Raza Ali’®, Mark J. Dunning’, 
Gordon D. Brown!, Ondrej Gojis**°, Ian O. Ellis°, Andrew R. Green®, Simak Ali*, Suet-Feung Chin', Carlo Palmieri’, 


Carlos Caldas’”*? & Jason S. Carroll’ 


Oestrogen receptor-a (ER) is the defining and driving transcrip- 
tion factor in the majority of breast cancers and its target genes 
dictate cell growth and endocrine response, yet genomic under- 
standing of ER function has been restricted to model systems’. 
Here we map genome-wide ER-binding events, by chromatin 
immunoprecipitation followed by high-throughput sequencing 
(ChIP-seq), in primary breast cancers from patients with different 
clinical outcomes and in distant ER-positive metastases. We find 
that drug-resistant cancers still recruit ER to the chromatin, but 
that ER binding is a dynamic process, with the acquisition of 
unique ER-binding regions in tumours from patients that are 
likely to relapse. The acquired ER regulatory regions associated 
with poor clinical outcome observed in primary tumours reveal 
gene signatures that predict clinical outcome in ER-positive disease 
exclusively. We find that the differential ER-binding programme 
observed in tumours from patients with poor outcome is not due to 
the selection of a rare subpopulation of cells, but is due to the 
FOXA1-mediated reprogramming of ER binding on a rapid time- 
scale. The parallel redistribution of ER and FOXA1 binding events 
in drug-resistant cellular contexts is supported by histological 
co-expression of ER and FOXA1 in metastatic samples. By 
establishing transcription-factor mapping in primary tumour 
material, we show that there is plasticity in ER-binding capacity, 
with distinct combinations of cis-regulatory elements linked with 
the different clinical outcomes. 

Recent technological advances have allowed mapping of ER- 
binding events, with the goal of discovering the cis-regulatory elements 
and factors involved in mediating ER binding and transcription. 
Several genome-wide maps of ER in breast cancer cell-line models 
exist’, all showing that most ER-binding events occur at distal cis- 
regulatory elements. Forkhead motifs are enriched within the regions 
bound by ER binding and multiple studies have identified the forkhead 
protein FOXA1 as an important pioneer factor for ER-chromatin 
interactions* °. However, the ER mapping studies have been restricted 
to breast cancer cell lines, mostly the MCF-7 cell line. We sought to 
interrogate ER-binding events, for the first time, in primary frozen 
breast cancer samples, to determine if ER binding is dynamic and if 
specific cis-regulatory elements can distinguish tumours from patients 
with distinct clinical outcomes. 

ER ChIP-seq was performed in eight ER*, progesterone receptor 
(PR)*, HER2 primary breast tumours, representative of tumours 
from patients with better prognosis’, a conclusion supported by the 
available long-term clinical follow-up (Supplementary Fig. 1). Also 
included were seven primary breast tumours from patients with a poor 
outcome (ER* PR  HER2° or ER‘ PR* HER’), because PR” or 
HER2* tumours are more likely to be aggressive*’. As expected, the 
poor outcome patients who had long-term clinical follow-up died of 


breast cancer (Supplementary Fig. 1). Furthermore, three ER” distant 
metastatic samples from women with breast cancer were included. The 
metastatic locations and sample preparation can be found in Sup- 
plementary Fig. 1. As a control, we included two breast cancer samples 
that were ER’ (ER-« negative), but expressed high transcript levels of 
ER-B. 

ER ChIP-seq was conducted and ER-binding peaks were called 
using two different algorithms, MACS"? and SWEMBL (http://www. 
ebi.ac.uk/~swilder/SWEMBL/), to minimize peak caller bias. The 
number of sequencing reads and ER-binding events for each tumour 
is shown in Supplementary Fig. 2. ER binding could be mapped in all 
tumours, but total peak intensity and the number of identified binding 
events differed. Three tumours were split into two sections and ER 
ChIP-seq was conducted on the separate sections. We found very good 
concordance when comparing different sections of the same tumour 
(R* = 0.954) suggesting that tumour heterogeneity did not substan- 
tially influence the ER-binding signal obtained from a sample (Sup- 
plementary Fig. 3). 

We initially assessed whether a conserved set of breast cancer ER- 
binding events could be identified. We found a core set of 484 ER- 
binding events that were identified in at least 75% of all the tumours, 
but not in either of the ER” tumours (Fig. 1a). Peak calling details can 
be found in Supplementary Fig. 4. An example of a core ER-binding 
event is shown in Fig. 1b. This demonstrates that ER binding to chro- 
matin still occurs even in tumours that are unlikely to respond to anti- 
oestrogen therapies, implying that drug resistance is not due to loss of 
ER binding to DNA. The average ER-binding signal intensity was 
highest in the metastatic samples and lowest in patients with good 
outcome tumours, a phenomenon observed both within the 484 core 
ER-binding regions (Fig. 1c) and globally (Supplementary Fig. 4). 
These data indicate that there is an acquisition of binding signal intensity 
in tumours that progress towards a poorer prognosis and ultimately 
metastasize. The only DNA motif found to be enriched in the core ER- 
binding events was an oestrogen responsive element (ERE) (Fig. 1d). 
The genes near (within 20 kilobases (kb): an optimal window between 
ER-binding events and target genes'') the 484 core ER-binding events 
exhibited elevated expression in the ER* tumours used for ChIP-seq, 
as compared to all other genes (data not shown) and were higher in 
ER* tumours relative to ER” tumours in nine independent data sets 
(Supplementary Fig. 5). The genes are provided in Supplementary Fig. 6 
and include classic ER target genes such as TFF1, GREB1 and RARA. A 
gene predictor was generated based on genes near the core ER-binding 
events. Patients were stratified and the tumours with the highest ‘risk 
index’ had a poor clinical outcome when compared to the tumours with 
the lowest ‘risk index’ (Fig. le shows the results based on one study’ and 
additional data sets are shown in Supplementary Fig. 6; only ER* 
patients were considered). These conserved cis-regulatory elements 
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Figure 1 | A subset of ER-binding events is conserved in primary breast 
tumours and distant metastases. a, Heat map showing binding peak intensity 
of 484 core ER-binding events that are common to primary breast tumours and 
distant metastases. The window represents +5 kb regions from the centre of the 
binding events. b, Example of an ER-binding event at the RARA locus, which is 
present in all primary ER™ breast tumours and metastases, but not the ER™ 
tumours. ¢, Normalized average signal intensity of all core ER-binding events. 
Also included are the ER’ tumours. d, Motif analysis revealed the enrichment of 
EREs. e, Genes within 20 kb of the core ER-binding events were used to generate a 
gene predictor that was tested in independent data sets for predictive value. 
Tumours were stratified according to expression of this gene signature and the top 
one third and bottom one third of tumours were compared. Results from one data 
set are shown” and additional data sets are provided in Supplementary Fig. 6. 


390 | NATURE | VOL 481 | 19 JANUARY 2012 


a b 
n Zio] “ —— 
i 8 o 8 
6 5 il ne 
. ‘= 8 g 
£ 3 84 
8 |] 2 | 
3 z s | 
a a : a ; 
i ‘. Clee 1 
£ & H i a ' : 
a 8 i ' 
-0.1 00 01 0203 04 © glug : H Feral 
UD ~ ; 
Principal component 1 ag = : : ; _ ' 
@ Good outcome = 24 £ : ; 
@ Poor outcome 2 i = H ; 
@ Metastases a i : a3 
S 04 o> || >= 38 
d Poor outcome and metastases Good Poor Mets Good Poor Mets 
L J L J 
g 1.0 + “Gian + Enriched in good Enriched in poor 
rs 08 he He ates outcome tumours outcome tumours 
a , \. (599 ER events) and metastases 
£ 06 Hh ’ (1,192 ER events) 
2 oa HH 
3 | CP =3x108 To 
8 
® 0.2 = 
e — Top 33% (n = 88) © Motifs in poor outcome ER-binding events 
¢ ~~ Bottom 33% (n = 88) 
& 0.0 
D I T T T 
a (e) 5 10 15 


ERE agit TS.6Cr 


Time (years) 


Good outcome 


£1044... Be Is] | TA Ixfs 
5 an . ee sia ; = ed 
oS ete pravests 
3 0:67 ? — Motifs in good outcome ER-binding events 
a 04 THe 
£ P=3x10% 
3 ERE ¢ 
& 0.24 — Top 33% (n = 88) CAs 1Ga = 
5 ——~ Bottom 33% (n = 88) 
% 0.04 
fa) T T T 

0 5 10 15 


Time (years) 


Figure 2 | ER-binding profiles can discriminate between tumours from 
patients with different clinical outcomes. a, Principal component analysis 
(44% of total variance) of the 1,791 ER-binding events that can discriminate 
between the patients with good outcome tumours and those with poor/met 
tumours. Included are the replicates from three tumours, which are highlighted. 
b, Box plot representing distribution of normalized read counts in differential 
ER-binding events that are statistically enriched in either the patients with good 
outcome tumours (599 ER-binding events) or the patients with poor outcome 
tumours and metastases (Mets) (1,192 ER-binding events). The samples were 
pre-normalized. **P <1 10 '°. Pvalues were calculated using the Wilcoxon 
rank sum test. c, Enriched motifs in the poor/met and the good outcome ER- 
binding events. d, Genes within 20 kb of the differentially bound ER-binding 
events were used to generate a gene predictor. Tumours were stratified 
according to expression of this gene signature and the top one third and bottom 
one third of tumours were compared. Results from one data set are shown” and 
additional data sets are provided in Supplementary Fig. 12. 


and their putative target genes may be the elements that contribute to 
tumorigenesis and are maintained regardless of the clinical outcome of 
the breast cancer patient. In contrast to the primary breast cancers and 
metastases, we mapped ER binding in three normal human mammary 
glands and two normal human liver samples and found limited numbers 
of ER-binding events, with almost no concordance in ER binding 
between individuals (data not shown). 

We sought to determine if differential ER-binding events could 
discriminate the patients with good outcome (ER’ PR* HER2> 
tumours), from patients with poor outcome or metastases (we 
described the combined set as poor/met tumours). After normaliza- 
tion of the data to account for global differences in ER binding, dif- 
ferential binding analysis (DBA) was used to identify ER-binding 
events that were statistically enriched in one category or the other. 
This resulted in a set of ER-binding events that could discriminate 
between the two groups when using principal component analysis 
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(Fig. 2a). In total, DBA revealed 1,192 genomic regions that had sig- 
nificantly more ER binding in the poor/met group compared to the 
good outcome patients (Fig. 2b), and 599 ER binding regions with 
more ER binding in the good outcome patients when compared to 
the poor/met patients (Fig. 2b). The clustering of the tumours on the 
basis of the 1,791 differential ER-binding events can be visualized in 
Supplementary Fig. 7. These findings suggest that there are specific and 
re-occurring cis-regulatory elements that are occupied by ER in breast 
cancers, but that these are different in tumours that respond to treat- 
ment versus those that relapse and metastasize. Analysis of enriched 
DNA motifs identified the presence of ERE and FOXA1 motifs in the 
differential poor outcome ER-binding events and ERE motifs in the 
good outcome ER-binding events (Fig. 2c). Correlation of the poor 
outcome ER-binding events with known processes revealed an asso- 
ciation with endocrine resistance and luminal B status (Supplementary 
Fig. 8). 

To investigate if the genes near the differential ER-binding events were 
potentially functional in breast cancer, we analysed genes within a 20 kb 
window around the 1,192 poor/met and 599 good outcome ER-binding 
events. Using a training set, we generated a gene expression predictor for 
each of the good and poor outcome gene lists. The probability calculation 
and comparisons between the good and poor outcome genes is shown 
in Supplementary Figs 9 and 10. Within the poor outcome gene list was 
the oncogene ERBB2 (all genes are shown in Supplementary Fig. 11). 
As expected, genes in the poor outcome predictor were preferentially 
upregulated in poor outcome patients, whereas those in the good out- 
come predictor were preferentially downregulated (Supplementary 
Fig. 10). We next tested the predictors in an independent large cohort 
of breast cancer patients'’, only considering ER* tumours. Using 
distant-metastasis-free survival as an endpoint, both gene sets predicted 
outcome (P = 3 X 10 ° for good and 3 X 10° * for poor outcome genes) 
in this data set’? and with the expected opposite directionality (Fig. 2d). 
The gene predictors were associated with survival in additional data sets 
(Supplementary Fig. 12) and were largely independent of histopatho- 
logical factors (Supplementary Fig. 13). We tested 1,000 randomiza- 
tions from the entire list of genes and determined that the probability 
that a random set of genes would yield an equally robust predictor of 
clinical outcome was P = 0.004. Furthermore, the good and poor gene 
predictors had no predictive power in four cohorts of ER” patients 
(Supplementary Fig. 14). This suggests that the increased ER binding 
at distinct cis-regulatory elements is functionally and biologically 
relevant, resulting in altered gene expression profiles that contribute 
to differences in drug response and overall survival. 

To validate the findings made in the tumours, we explored the 
possibility that ER-binding events were acquired in cell-line models 
of endocrine resistance. ER binding was mapped by ChIP-seq in three 
commonly used tamoxifen-responsive, ER™ breast cancer cell lines 
(MCF-7, T-47D and ZR75-1), and two tamoxifen-resistant, ER* 
breast cancer cell lines, namely a tamoxifen-resistant MCF-7 derivative 
(TAM-R)® and BT-474 cells that are ER* and contain the ERBB2 
amplification (ER* HER2° ). Similar to BT474 cells, TAM-R cells have 
increased ERBB2 protein levels'’ and both represent cellular systems in 
which increased growth factor signalling results in endocrine resist- 
ance. For all five cell lines, ER ChIP-seq was performed in at least 
duplicate, in asynchronous cells to recapitulate the situation observed 
in primary tumours (Supplementary Fig. 2). 

Almost 7,000 (6,920) ER-binding events were identified in all repli- 
cates of all cell lines (Fig. 3a and Supplementary Fig. 15). Almost all 
(98.9%) of the core ER-binding events that occurred in most primary 
tumours (Fig. 1a) overlapped with the cell-line core ER-binding events. 
DBA identified 8,188 ER-binding events with significantly stronger 
binding affinity in the tamoxifen-resistant cell lines, and 5,713 ER- 
binding events that were stronger in the tamoxifen-responsive cell 
lines (Fig. 3a). Examples of differentially bound regions are shown 
in Fig. 3b. Using the differential ER-binding events, the cell-line 
classification can be visualized by principal component analysis 
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Figure 3 | Identification of a tamoxifen-resistant ER-binding profile. 

a, Heat map representing ER-binding events found in all tamoxifen (Tam)- 
responsive and tamoxifen-resistant cell lines, or those enriched in either 
responsive or resistant cell lines. The window represents +5 kb regions from 
the centre of the binding events. Numbers indicate the total number of ER- 
binding events in each category. b, Examples of ER-binding events that are 
unique to either tamoxifen-responsive or tamoxifen-resistant cell lines. 

c, Principal component analysis (65% of total variance) of differential ER- 
binding events observed in responsive (blue) or resistant (red) breast cancer 
cells. d, Hierarchical clustering of the top 1,500 differentially bound ER-binding 
events (FDR <0.006). e, Enriched motifs within the ER-binding events that 
discriminate between tamoxifen-sensitive and resistant cell lines. 


(Fig. 3c) and hierarchical clustering (Fig. 3d). Enriched motif analysis 
revealed ERE and FOXA1 motifs in regions showing increased ER 
binding in tamoxifen-resistant cell lines (Fig. 3e), which are the same 
motifs observed in the poor outcome ER-binding events in primary 
tumours (Fig. 2c). GATA motifs were enriched in ER-binding events 
depleted during acquisition of drug resistance (Fig. 3e), possibly due to 
competition between FOXA1 and GATA3, another prominent breast 
cancer transcription factor. 

Wehypothesized that the ER-binding events induced in the tamoxifen- 
resistant breast cancer cell lines would be the same regions that were 
enriched in the poor/met clinical samples. However, 79.8% of the 1,192 
ER binding events enriched in the poor/met samples (from Fig. 2b) 
overlap with ER-binding events in wild-type MCF-7 cells, suggesting 
that the cell-line models are closer to the tumours and metastases from 
poor outcome patients. In support of this, the 599 good outcome ER- 
binding events observed in primary tumours (Fig. 2b) overlap poorly 
with the ER binding observed in MCF-7 cells (30.2% versus 79.8% for 
the poor outcome). Interestingly, MCF-7 cells (plus T-47D and ZR75-1 
cells) were derived from the pleural effusion of metastatic breast cancer 
patients, but were established before tamoxifen use in the clinic. We 
propose that MCF-7, ZR75-1 and T-47D cell lines possess an inter- 
mediate ER-binding profile with the acquisition of additional ER- 
binding regions required for resistance to anti-oestrogen treatment. 
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The differences in ER binding between responsive and resistant 
contexts may be due to selection and expansion of a resistant sub- 
population, or may be due to reprogramming of ER binding following 
specific stimuli. Growth factor pathways have long been implicated in 
modulating endocrine response’*"® and have been shown to influence 
ER-binding potential and gene expression profiles". We identified 
various stimuli (EGF, IL-6, TNF-« and IGF-I) shown to induce 
increased cellular invasion and drug resistance and treated asynchron- 
ous MCF-7 cells with control or the cocktail of mitogens for 90 min 
(Fig. 4a). Duplicate ER ChIP-seq replicates were performed (Sup- 
plementary Fig. 2). 

Differential binding analysis identified 6,089 ER-binding regions 
that were differentially enriched (=4 fold change difference, false 
discovery rate (FDR) <0.1) following a 90 min treatment with the 
cocktail. These mitogen-induced ER-binding differences could be 
visualized using principal component analysis (Fig. 4b). Because 
FOXAI is a pioneer factor required for ER-chromatin interactions* 
and FOXAI motifs were enriched in both the mitogen-induced ER- 
binding events (Supplementary Fig. 16) and the tumours from the 
poor outcome patients (Fig. 2c), we assessed whether the rapid, repro- 
grammed ER binding occurred at regions pre-determined by FOXA1. 
We repeated the mitogen treatment, but mapped FOXA1 binding by 
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Figure 4 | ER and FOXA1 binding is dynamic and their expression 
correlates in metastases. a, Morphological changes in MCF-7 cells treated 
with a cocktail of mitogens (EGF, IGF-1, IL-6 and TNF-«) for 90 min. 
Magnification, X40. b, Principal component analysis (93% of total variance) of 
differentially bound ER-binding events can discriminate between the control 
and mitogenic cocktail-treated samples in the first (and second) component. 
c, Heat map showing induction and loss of ER and FOXA1 binding after 90 min 
of treatment with the mitogenic cocktail. Also shown is the normalized average 
signal intensity of all ER- and FOXA1-binding events within the ‘gained’ ER- 
binding events. d, Metastatic samples from different sites were stained by 
immunohistochemistry for ER and FOXA1. Example of ER and FOXA1 
expression in a breast cancer bone metastasis. Magnification, * 40. e, Graphical 
representation of ER and FOXA1 co-expression in metastases. Location of 
metastases is provided, as is the Allred score. 
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ChIP-seq and found that ~25% of the reprogrammed ER-binding 
events (1,515) occur at regions that are already bound by FOXA1, 
before mitogen treatment (Fig. 4c). A substantial proportion (37.6%) 
of the other reprogrammed ER-binding events occur at regions where 
FOXAI binding is also induced by mitogens. As such, ~53% of 
mitogen-induced ER-binding events occur at regions pre-bound by 
FOXAI or at regions that also acquire FOXA1 binding (Fig. 4c), a level 
of concordance that mirrors the ~50% overlap observed in wild-type 
cells®. 

To determine if FOXA1 expression was present in ER distant 
metastases, we obtained 24 metastatic samples (bone, brain and liver) 
from ER™ breast cancer patients and performed immunohistochemistry 
for ER and FOXA1 (Fig. 4d). We found that ~87% of the metastases 
retained ER expression and that FOXA1 expression occurred in ~95% of 
the metastases (Fig. 4e). Importantly, the concordance between ER and 
FOXAI was high (R* = 0.585), regardless of the site of metastasis. 
Therefore, the co-expression of ER and FOXA1 in distant metastases 
supports our conclusions that FOXA1 mediates ER reprogramming. 

By mapping ER binding in clinical samples, we provide a first 
glimpse of the primary regulatory regions that contribute to differ- 
ences within ER* breast cancers, rather than secondary events such as 
gene expression profiles. Our findings suggest that there is plasticity in 
ER binding, with distinct ER-binding profiles associated with clinical 
outcome. These differential ER-binding profiles seem to be mediated 
by FOXA1. A remaining question is what dictates differential FOXA1 
and subsequently ER binding. Possibilities include changes in the 
genomic landscape, alterations in co-factor levels or changes in 
FOXAI structure and function, potentially by post-translational 
modifications. By establishing transcription factor mapping in primary 
samples, we find that differential ER-binding patterns govern gene 
expression programs and are associated with clinical outcome in ER* 
cancer. 


METHODS SUMMARY 


MCE-7, ZR75-1, T-47D and BT-474 human cell lines were obtained from the 
American type culture collection (ATCC) and grown in the relevant media. 
TAM-R cells'* were a gift from I. Hutcheson and R. Nicholson (Cardiff). The 
ER* breast cancer tumours were obtained from the Nottingham Tenovus primary 
breast cancer series, Addenbrooke’s Hospital and Imperial College Healthcare 
NHS Trust, with appropriate ethical approval from the repositories. The malig- 
nant pericardial effusion and the two distant metastases were obtained from 
Imperial College Healthcare NHS Trust. For ChIP in the tumours and metastases, 
the frozen sample was cut into smaller pieces before ChIP, which was then per- 
formed as previously described’. For the malignant pericardial effusion, epithelial 
cells were first enriched using Dynabeads conjugated with Epcam’’. For ChIPs 
from cell-line material, proliferating cells were cross-linked and processed for 
ChIP as previously described’. The antibodies used were anti-ER (sc-543) from 
Santa Cruz Biotechnologies and anti-FOXA1 (ab5089) from Abcam. Sequences 
generated by the Illumina Genome Analyzer were processed by the Illumina 
analysis pipeline version 1.6.1,andaligned to the Human Reference Genome (assembly 
hg18, NCBI build 36.1, March 2008) using BWA version 0.5.5 (ref. 18). Differential 
binding analysis was performed using the DiffBind package’. For immunohisto- 
chemical analyses, ER staining was conducted using the 6F11/2 mouse monoclonal 
antibody (Novocastra, Leica Microsystems) and FOXA1 staining was conducted 
using a rabbit polyclonal antibody (ab23738) from Abcam. An Allred scoring 
system was used to assess staining accounting for both staining intensity and the 
proportion of cells stained. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culture. MCF-7, ZR75-1, T-47D and BT-474 human cell lines were obtained 
from the American type culture collection (ATCC). MCF-7 cells were grown in 
DMEM containing 10% heat-inactivated FBS, 2mM 1-glutamine, 50U ml’ 
penicillin and 50g ml’ streptomycin. ZR75-1, T-47D and BT-474 cells were 
grown in RPMI containing 10% heat-inactivated FBS, 2mM 1-glutamine, 
50U ml ' penicillin and 50,1gml~' streptomycin. To validate the cell lines, 
samples were genotyped by the Health Protection Agency (http://www.hpa. 
org.uk) (MCF-7, ZR75-1, T-47D) or by in-house genotyping (BT-474). TAM-R 
cells'® were obtained from I. Hutcheson and R. Nicholson (Cardiff) and were 
maintained in phenol-red-free DMEM containing 5% charcoal dextran-treated 
FBS, 2mM L-glutamine, 50 U ml! penicillin and 50 pg ml! streptomycin and 
10nM 4-hydroxytamoxifen. 

Primary tumour material. The ER breast cancer tumours were obtained from 
the Nottingham Tenovus primary breast cancer series, Addenbrooke’s Hospital 
and Imperial College Healthcare NHS Trust, with appropriate ethical approval 
from the repositories. The malignant pericardial effusion and the two distant 
metastases were obtained from Imperial College Healthcare NHS Trust. See 
Supplementary Fig. 2 for clinical details. 

Chromatin immunoprecipitations. The antibodies used were anti-ER (sc-543) 
from Santa Cruz Biotechnologies and anti-FOXA1 (ab5089) from Abcam. For 
ChIP in the tumours and metastases, the frozen sample was cut into smaller pieces 
and thawed in 1% (final concentration) formaldehyde for 20 min at room tem- 
perature (20 °C). The reaction was quenched by adding 0.1 volume of 2 M glycine 
for 10 min. The sample was disaggregated by Dounce homogenization and pro- 
cessed according to standard ChIP procedures'®. The DNA was subsequently 
amplified as previously described'®. For the malignant pericardial effusion, 
epithelial cells were first enriched using Dynabeads conjugated with Epcam”. 
For ChIPs from cell-line material, proliferating cells were cross-linked and pro- 
cessed for ChIP as previously described'*. For the TAM-R cells, ER ChIP-seq was 
performed on cells grown in DMEM containing 10% FBS and 10 nM tamoxifen 
for 24h. For the cocktail experiments, cells were treated with 100 ng ml | IGF-1, 
100ng ml’ EGF, 1ngml* TNF- and 10ng ml? IL-6 for 90 min. 
High-throughput sequencing and enrichment analysis. Sequences generated by 
the Illumina Genome Analyzer were processed by the Illumina analysis pipeline 
version 1.6.1, and aligned to the Human Reference Genome (assembly hg18, NCBI 
build 36.1, March 2008) using BWA version 0.5.5 (ref. 18). Reads were filtered by 
removing those with a BWA alignment quality score less than 15. Enriched regions 
of the genome were identified by comparing the ChIP samples to input samples 
using the MACS peak caller version 1.3.7.1 (ref. 10). Additional peak calls were 
determined for the tumour samples using the SWEMBL peak caller version 3.2 (S. 
Wilder et al., manuscript in preparation) with default parameters except with 
—R = 0.005. For tumours without a corresponding input, all the available tumour 
input reads were combined and down sampled to derive a control track for peak 
calling purposes. 

Differential binding analysis. After identifying a common set of peaks (see 
Supplementary Figs 4A and 15), significantly differentially bound sites were iden- 
tified by first counting the number of reads in the ChIP samples overlapping each 
identified peak (subtracting the number of overlapping reads from the corres- 
ponding input sample). Differential analysis was performed using the edgeR 
package”®. Normalization factors were computed using the TMM technique, after 
which tagwise dispersions were calculated and subjected to an exact test”. 
Resulting P values were subjected to Benjamini-Hochberg multiple testing cor- 
rection to derive FDRs; only sites differentially bound with a FDR < 0.1 (cell line 
samples) or a P value <0.01 were considered for further analysis. The differential 
binding analysis and related plots were performed using the DiffBind package”’. 
Heat-map generation. To generate the heat maps of the raw ChIP-seq data, ER- 
binding peaks in the different samples were used as targets to centre each window. 
Each window was divided into 100 bins of 100 bp in size. An enrichment value was 
assigned to each bin, counting the number of sequencing reads in that bin normal- 
ized to 10,000,000 sequencing reads, after subtracting the number of normalized 
input reads per bin. 

Motif analysis. For analysis of enriched DNA motifs, Weeder was used, with the 
default settings”. 

Distant-metastasis-free-survival analysis. First, we computed for each gene in 
the given ER-binding list (ER-core-binding events, good outcome ER binding, 
poor/met ER binding) a P value and statistic from a univariate Cox regression, 
individually in nine independent data sets'**”-”*. This was done to ascertain if the 


observed directionalities of association between gene expression and outcome 
were consistent with the predictions from the differential ER-binding events. 
Histograms of P values over all cohorts clearly confirmed the association of the 
ER-binding events with distant-metastasis-free survival (DMFS; Supplementary 
Fig. 9). Moreover, genes more bound in poor prognosis and metastatic tumours 
were generally more highly expressed in poor prognosis tumours (Supplementary 
Fig. 10). 

We adopted a two-step strategy to first learn from the pool of genes in a given 
ER-binding list, a subset with patterns of gene expression that are associated with 
DMES in a training set, and second to then validate this predictor in completely 
independent data. We used the ER™ breast cancer samples from two independent 
data sets as training sets**”*, re-normalized the gene expression profiles to mean 
zero and unit variance and applied a singular value decomposition to the resulting 
reduced expression matrices. For each of the top 10 eigenvectors (principal com- 
ponents) we correlated their profiles across samples to DMFS using a Cox regres- 
sion, and next identified DMFS-associated eigenvectors, which were also highly 
correlated (or anticorrelated) between the two training sets. Thus, our algorithm is 
similar to the supervised principal components algorithm (SPCA) previously 
described*°, but differs in that we extend this procedure to two training sets. We 
consistently identified DMFS-associated eigenvectors that were also highly corre- 
lated between the two training sets. A prognostic predictor was constructed by 
averaging the DMFS-associated eigenvectors from each training set. The predictor 
was subsequently tested in seven independent ER™ breast cancer cohorts'?”*”°. 
Specifically, for each sample we computed the Pearson correlation of the predictor 
with the sample expression profile over the genes that could be mapped. The 
resulting correlation can be viewed as the risk index. In each cohort this risk index 
was then evaluated in a Cox regression to test ability to predict DMFS. The data 
shown in Fig. le and Fig. 2d is derived from the Loi data set'’, but the additional 
data sets are provided in Supplementary Fig. 6 and 12. 

The Sweave document describing all analyses can be found at http:// 

¢3315341.r41.cf0.rackcdn.com/Data.tar.gz. 
Immunohistochemisty. Immunohistochemistry of full-face sections was per- 
formed using a BondMax Autoimmunostainer (Leica Microsystems). ER staining 
was conducted using the 6F11/2 mouse monoclonal antibody (Novocastra, Leica 
Microsystems) at 1 in 70 and antigen retrieval was performed by heating with pH 6 
citrate buffer for 30 min. FOXA1 staining was conducted using a rabbit polyclonal 
antibody (ab23738) from Abcam, at 1 in 800 with antigen retrieval performed 
by heating with pH 6 citrate buffer for 20 min. Bound primary antibody was 
detected using a BOND polymer detection kit (Leica) and developed with 3-3'- 
diaminobenzidine (DAB). The degree of staining was assessed by a pathologist 
(H.R.A.) using a standard light microscope. An Allred scoring system was used to 
assess staining accounting for both staining intensity (0 = none, 1 = weak, 
2 = moderate, 3 = strong) and the proportion of cells stained (0 = 0%, 1< 1%, 
2 = 1-10%, 3 = 11-33%, 4 = 34-66%, 5 > 66%), providing a composite score 
(intensity + proportion = 0-8). 
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Natural killer cells act as rheostats modulating 


antiviral T cells 


Stephen N. Waggoner’, Markus Cornberg”, Liisa K. Selin' & Raymond M. Welsh! 


Antiviral T cells are thought to regulate whether hepatitis C virus 
(HCV) and human immunodeficiency virus (HIV) infections result 
in viral control, asymptomatic persistence or severe disease, although 
the reasons for these different outcomes remain unclear. Recent gen- 
etic evidence, however, has indicated a correlation between certain 
natural killer (NK)-cell receptors and progression of both HIV and 
HCV infection’ *, implying that NK cells have a role in these T-cell- 
associated diseases. Although direct NK-cell-mediated lysis of virus- 
infected cells may contribute to antiviral defence during some virus 
infections—especially murine cytomegalovirus (MCMV) infections 
in mice and perhaps HIV in humans**—NK cells have also been 
suspected of having immunoregulatory functions. For instance, 
NK cells may indirectly regulate T-cell responses by lysing 
MCMV-infected antigen-presenting cells®’. In contrast to MCMV, 
lymphocytic choriomeningitis virus (LCMV) infection in mice 
seems to be resistant to any direct antiviral effects of NK cells**. 
Here we examine the roles of NK cells in regulating T-cell-dependent 
viral persistence and immunopathology in mice infected with 
LCMV, an established model for HIV and HCV infections in 
humans. We describe a three-way interaction, whereby activated 
NK cells cytolytically eliminate activated CD4 T cells that affect 
CD8 T-cell function and exhaustion. At high virus doses, NK cells 
prevented fatal pathology while enabling T-cell exhaustion and viral 
persistence, but at medium doses NK cells paradoxically facilitated 
lethal T-cell-mediated pathology. Thus, NK cells can act as rheostats, 
regulating CD4 T-cell-mediated support for the antiviral CD8 T cells 
that control viral pathogenesis and persistence. 

Intravenous (i.v.) inoculation of C57BL/6 mice with a low (5 X 10* 
plaque-forming units (p.fu.), medium (2X 10°p.f.u.), or high 
(2X 10° p-fu.) dose of LCMV, strain clone 13, resulted in different 
degrees of pathology, as indicated by weight loss (Fig. la) and by 
histological analysis of lung sections at day 15 post-infection (p.i.) 
(Fig. 1b). The high dose caused a precipitous drop in body weight 
during the first week of infection (Fig. 1a, right) but, thereafter, clonal 
exhaustion and deletion of LCMV-specific T cells resulted in a per- 
sistent infection”"® associated with minimal lung pathology (Fig. 1b, 
right) and 100% (77 of 77) survival (Fig. 1c, top). Selective depletion of 
NK cells using 25 ug of anti-NK1.1 monoclonal antibodies (Sup- 
plementary Fig. 1) 1 day before high-dose infection resulted in 58% 
(35 of 65) mortality between days 9 and 13 of infection (Fig. 1c, top) 
associated with severe pulmonary oedema (data not shown) and 
reduced viral titres by day 7 p.i. (Fig. 1d, right). Under these high-dose 
conditions, therefore, the presence of NK cells promoted persistence 
and prevented mortality. 

In contrast to the beneficial role of NK cells during high-dose infec- 
tion, NK-cell depletion prevented the severe weight loss (Fig. 1a, 
middle) and tissue pathology (Fig. 1b, middle) associated with the 
medium dose of LCMV. Twenty-three per cent (7 of 31) of control- 
treated mice succumbed to the medium dose during the second week 
of infection, and the lungs of surviving mice exhibited bronchus- 
associated lymphoid tissue, pulmonary oedema and _ interstitial 


mononuclear infiltration. Lung pathology was absent in NK-cell- 
depleted mice, which uniformly survived medium-dose challenge 
(Fig. 1c, bottom). Moreover, although high levels of replicating virus 
persisted in surviving control mice at day 15 p.i, NK-cell depletion 
resulted in complete viral clearance (Fig. 1d, middle). In this case the 
presence of NK cells was detrimental for the host, as they promoted 
immune pathology and death. 

Irrespective of the presence of NK cells, inoculation with a low dose 
of virus was uniformly non-lethal in 18 of 18 (100%) control and 18 of 
18 (100%) of NK-cell-depleted mice by >50 days p.i., with minimal 
weight loss (Fig. la, left) and minimal lung pathology (Fig. 1b, left). 
Virus was completely cleared in both groups of mice by day 15 of low- 
dose infection (data not shown), but NK-cell depletion resulted in 
more rapid elimination of LCMV in liver by day 7 p.i. (Fig. 1d, left). 

The weight loss, lung pathology and mortality observed in medium- 
dose-infected wild-type mice (Fig. 1a, b) did not occur after infection of 
of T-cell-receptor-deficient (Tcrb ‘~ )mice, and NK-cell depletion of 
Tcrb-’~ mice did not alter weight loss or viral burden (Supplementary 
Fig. 2). Thus, NK cells regulate viral clearance and immunopathology 
during LCMV infection through a T-cell-dependent mechanism. 

As early as day 6 after medium-dose infection, the proportion and 
number of interferon-y (IFN-y)* LCMV-specific CD8 T cells was 
increased two- to sixfold in mice depleted of NK cells (Fig. 2a and 
Supplementary Fig. 3), and antiviral T cells from these mice showed an 
enhanced ability to co-produce tumour necrosis factor (TNF) 
(Supplementary Fig. 3). The number of LCMV epitope NP396_404 
tetramer-binding CD8 T cells in the spleen on day 5 p.i. was increased 
4- to 20-fold in NK-cell-depleted mice relative to non-depleted control 
mice after infection with all doses of virus (Fig. 2b). The number of 
virus-specific IFN-y* CD4 T cells was also amplified 7- to 20-fold by 
NK-cell depletion compared to control mice on different days after 
medium-dose infection (Fig. 2c). Moreover, co-production of TNF 
and interleukin-2 (IL-2) by antiviral CD4 T cells was augmented by 
NK-cell depletion (Fig. 2d and Supplementary Fig. 3). The increased 
magnitude of the LCMV-specific T-cell response in the absence of NK 
cells during medium-dose infection correlated with rapid viral clear- 
ance (Fig. 2e). Depletion of NK cells using a carefully titrated dose of 
anti-asialo GM1 antibody, which eliminates NK cells but not CD8 T 
cells'’, also enhanced antiviral CD4 and CD8 T-cell responses during 
medium-dose infection (Supplementary Fig. 4). 

The enhanced antiviral T-cell responses suggested that NK-cell 
depletion may augment proliferation of LCMV-specific T cells. 
Transfer of carboxyfluorescein diacetate succinimidyl ester (CFSE)- 
labelled Thy1.1* T cells revealed a larger population of CFSE” donor 
CD4 (Fig. 2f) and CD8 (data not shown) T cells in multiple host tissues 
at day 6 p.i. of high-dose infection in the absence of NK cells. There was 
also greater specific lysis of viral-peptide-coated target cells as detected 
by a conventional in vivo cytotoxicity assay at day 4 of infection (Sup- 
plementary Fig. 5). Moreover, LCMV-specific Ly5.1* TCR transgenic 
(P14) CD8 T cells (transfer 10°) were recovered from tissues of NK- 
cell-depleted recipient (Ly5.2") mice at two- to ninefold greater 
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Figure 1 | NK cells influence T-cell-dependent 


a A : 
© sia Low dose Fito Medium dose et 2 iag High dose pathology and viral persistence during LCMV 
4 r = LH = infection. a-d, C57BL/6 mice treated with IgG2a 
2 100 100 2 100 (Control) or anti-NK1.1 (ANK) were infected with 
S - 2 9 = a low (5 X 104 p.f.u.), medium (2 X 10° p.f.u.), or 
s s 3 high (2 X 10° p.f.u.) doses of LCMV. a, Weight loss 
2 80) ~ rea 2 80) re 2 80) i (mean + s.e.m.) during infection (n = 3-43 per 
= al = 70 = 70 group per day), *P < 0.05, **P < 0.01. 
= 0 2 4 6 8 19 12 14 F£ 0 2 4 6 8 10 12 14 £ 0 4 6 8 ge 12 14 b, Haematoxylin & eosin (H&E) staining of lung 
Days p.i. Days p.i. Days p.i (X400) at day 15 p.i. c, Survival after high- or 
b Low dose Medium dose High dose ¢ medium-dose infection. d, Viral titres 
= (mean + s.e.m.) after low (day 7, n = 3 per group), 
BS . _ : 
3 = 100 Control medium (day 15, n = 9-15 per group), or high (day 
= B 75 (n =77) 8, n = 3 per group) dose infection. Dotted line 
8 ae) represents limit of detection. 
®& 50 
cs 
© 25 ANK 
2 |P<0.0001 (n = 60) 
x 3S 0 ; : 1 
4 ”n 0 10 20 30 
? Days p.i. 
> ANK 
d P=0.016 Wi Control jj ANK P=0.013 = 0 (n =31) 
0) = 
5 et P <0.0001 8 8 
a GB 6 an a TD 75 eT 
Za D> 2 Ss Control 
5 2 6 2 50 (n =31) 
3 3 3 9 
= 3 = 4 ian Ee 
3 a 3 £ 25 
S 2M 2° = S | P=0.006 
ES) o 2+ -- -- - 3 = 0 r r r 1 
1 o 1 2 G 0 5 10 15 20 
Day 7 Spleen Liver Kidney Day 8 Days p.i 
Low dose Day 15 medium dose High dose 


numbers than control-treated mice 6 days after low-dose infection 
(Supplementary Fig. 5). Together these results indicate that NK1.1* 
cells repress the size of the antiviral T-cell response during LCMV 
infection. 

The activities of CD4 T cells are important for maintaining CD8 
T-cell function during LCMV infection'*""*. To assess whether CD4 T 
cells were involved in the NK-cell suppression of LCMV-specific CD8 
T cells, mice were treated with antibodies to concurrently deplete both 
NK and CD4 T cells. Whereas depletion of NK cells before medium- 
dose LCMV infection resulted in a >200-fold reduction in splenic viral 
titres at day 7 p.i. relative to control and CD4-depleted (ACD4) mice 
(Fig. 3a), depletion of both NK and CD4 T cells (ANKACD4) had no 
effect on viral titres. The increase in the number of antiviral CD8 T cells 
producing more than one cytokine caused by NK-cell depletion was 
also prevented by co-depletion of CD4 T cells (Fig. 3b). In contrast, co- 
depletion of NK and CD8T cells did not prevent an increase in IFN-y* 
LCMV epitope GP¢1-so-specific CD4 T cells (control: 3.8 + 0.5% 
versus ANK: 9.8 + 0.7% versus ANK/ACD8: 10.7 + 1.6%, n= 3, 
P<0.05 versus control) at day 12 of medium-dose infection. 

Paradoxically, at the high virus dose, co-depletion of NK and CD4 T 
cells prevented the severe pulmonary oedema (Fig. 3c) and increased 
mortality (Fig. 3d) associated with depletion of NK cells alone. In this 
experiment, mice were harvested on day 12 p.i., when three surviving 
NkK-cell-depleted mice were moribund and required euthanasia, 
whereas all double-depleted mice showed relatively normal vigour. 
The livers of NK/CD4 double-depleted mice contained 25-fold more 
p-fu. than livers from mice depleted of NK cells alone (NK: 5.7 + 0.2 
p-f.u. versus ANK/ACD4: 7.1 + 0.1 p.f.u.,n = 5, P< 0.0001). Enhance- 
ment of LCMV-specific CD8 T cells in the absence of NK cells was 
also abrogated by concurrent depletion of CD4 T cells (Supplementary 
Fig. 6). Together these data indicate that CD4 T cells are needed for 
NkK-cell modulation of antiviral CD8 T-cell responses associated with 
viral clearance, persistence and immunopathology. 

We used a modified in vivo cytotoxicity assay by injecting 
splenocytes from medium-dose-infected NK-cell-depleted mice 


(Ly5.1°, day 4 p.i.) into medium-dose-infected NK-cell-depleted 
(ANK) or isotype IgG2a-treated (control) recipient mice (Ly5.2", 
day 3 p.i.). After 5h, similar proportions of total donor T (control: 
0.16 + 0.03% versus ANK: 0.15 + 0.02%, n = 21, P = 0.80) and B cells 
(control: 1.8 + 0.2% versus ANK: 1.7 + 0.2%, n = 21, P = 0.88) were 
recovered from infected recipients, regardless of NK-cell depletion. 
Likewise, recovery of activated (CD44™ CD43(1B11)*) donor CD8 
T cells was similar from spleens of control and ANK mice, with min- 
imal loss relative to uninfected control mice (Fig. 4a). In contrast, there 
was a substantial loss of activated donor CD4 T cells in infected relative 
to uninfected recipients, and this loss was prevented by depletion of 
NK cells (Fig. 4a). The magnitude of NK-cell-dependent loss of 
activated donor CD4 T cells was similar in low-, medium- and high- 
dose-infected recipients (Fig. 4b). More activated CD4 T cells, both 
donor and host derived, in infected (control) mice stained positively 
for the apoptosis indicator annexin V in comparison to naive donor 
CD4 T cells or to activated donor CD4 T cells in medium-dose- 
infected ANK recipient mice (Fig. 4c). In contrast to activated donor 
CD4 T cells, the recoveries of naive (CD44'°”) phenotype CD4 and 
CD8 donor T cells were not altered by NK-cell depletion (data not 
shown). These data indicate that NK cells in wild-type mice selectively 
and rapidly target activated CD4 T cells for elimination during LCMV 
infection. 

We next examined the involvement of NK-cell cytolytic mediators 
Fasl, TNF and perforin (Prfl) in this process. The loss of activated 
wild-type donor CD4 T cells in infected wild-type recipient mice 
(Fig. 4a, d) was seen when activated Fas" (Fas mutant) mouse donor 
cells were transferred into wild-type recipient mice or when wild-type 
donor cells were transferred into Tnf~ recipient mice (Fig. 4d). In 
contrast, there was relatively little loss of activated wild-type donor 
CD4 T cells in Prfl' ~ hosts (Fig. 4d), whose retention of activated 
donor CD4 cells was not significantly different (P > 0.1) from that in 
NK-cell-depleted wild-type or Prfl~’~ hosts. Thus, NK-cell elimina- 
tion of activated CD4 T cells is mediated through a perforin-dependent 
pathway that does not require Fas or TNF. 
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Figure 2 | LCMV-specific T-cell responses enhanced in NK-cell-depleted 
mice. a—c, Number (mean + s.e.m.) of LCMV-specific T cells measured by 
IFN-y expression (a, c) or tetramer-binding (b) at various days p.i. 

(a, c) (medium dose, n = 3 per group per day) or at day 5 p.i. (b) (various doses, 
n = 3-11 per group). d, Co-production of TNF and IL-2 by gated IFN-y* CD4 
T cells (day 8 p.i., medium dose) after in vitro stimulation with GP61 peptide. 
e, Viral titres in liver (n = 3-11 per group per day, medium dose). f, CFSE 
dilution by donor (Thyl.1*) CD4 T cells in uninfected (Uninf.) and infected 
(medium dose, day 6 p.i.) control and ANK Thy1.2* host mice. Control versus 
ANK mice, *P < 0.05, **P < 0.01. 


Previous work has implicated NKG2D (also known as Kirk1) in 
targeting of activated T cells by murine NK cells in vitro’, but we 
observed no differences in activated wild-type donor CD4 T-cell survival 
in wild-type versus Nkg2d ‘~ recipients (Fig. 4d) or in wild-type mice 
treated with a blocking monoclonal antibody to NKG2D** (data not 
shown). Of note is that we did not observe the expression of ligands for 
activating NK-cell receptors including NKG2D, NKp46 (also known as 
NCR1), DNAM-1 (also known as CD226) and TRAIL (also known as 
TNFSF10) on these early activated CD4 T cells, and NK-cell-mediated 
elimination of activated donor CD4 T cells also occurred in antibody- 
deficient (uMT ‘~) mice (data not shown), precluding a role for anti- 
body-dependent mechanisms. Activated CD4 T cells did, however, 
express much higher levels of adhesion molecules than naive cells, 
and these molecules have previously been shown to trigger NK-cell 
cytotoxicity via LFA-1'’”°. Somewhat surprising was the observation 
that the activated CD4 T cells were far more susceptible than activated 
CD8 T cells to direct killing by the NK cells, even though both 
expressed high levels of adhesion molecules. We previously had shown 
that the presence of the negatively signalling receptor CD244 (2B4) on 
NK cells prevented NK-cell-mediated lysis of activated CD8 T cells”. 
We found here that although expression of the CD244 ligand CD48 
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Figure 3 | Role of CD4 T cells in NK suppression of antiviral CD8 T cells, 
viral control and immunopathology. a-e, Prior to medium- (a, b) or high- 
dose (c-e) infection, mice were injected with isotype (Con, control), anti-NK1.1 
(ANK), anti-CD4 (ACD4), or both (ANKACD4). a, b, Viral titres in spleen 
(a) and number of cytokine-producing NP396_404-specific CD8 T cells (n = 5 
per group, + s.e.m.) in spleen at day 7 of medium-dose infection (b). ¢, d, H&E 
staining of lung (X100) (c) and survival at day 12 of high-dose infection 

(n = 3-5 per group) (d). 


was upregulated on T cells after medium-dose LCMV infection, 
expression levels of CD48 were much higher on activated CD8 than 
on activated CD4 cells (mean fluorescence intensity (MFI) of CD48: 
activated CD4, 3,423 +147; activated CD8, 6,180+ 166; n=9, 
P<0.0001) (Supplementary Fig. 7). 

To assess whether NK-cell-mediated lysis of activated CD4 T cells is 
a general principle of virus infections, we examined the loss of LCMV- 
activated CD4 T cells after transfer into mice inoculated with an un- 
related Arenavirus, Pichinde virus (PV), the Coronavirus mouse 
hepatitis virus (MHV), or the interferon inducer and NK-cell activator 
polyinosinic:polycytidylic acid (polylI:C). All three stimuli induced 
measureable loss of activated donor CD4 T cells that was dependent 
upon the presence of NK cells (Fig. 4e). In reciprocal experiments, CD4 
T cells activated during infection with PV, MHV, vaccinia virus (VV) 
or MCMV were lost upon transfer into mice infected with medium- 
dose LCMV when NK cells were present (Fig. 4f). 

An analysis, by in vivo cytotoxicity assays after transfer into NK- 
cell-sufficient mice, of the window of time during which NK-cell regu- 
lation of T cells occurred in the LCMV medium-dose model showed 
reduced frequencies of activated donor CD4 T cells relative to unin- 
fected recipients 1 (43%), 2 (32%), 3 (26%), 4 (4%) and 5 (8%) days 
after infection (Supplementary Fig. 8). The frequencies of activated 
donor CD4 T cells were increased by NK-cell depletion in recipient 
mice only at day 2 and day 3 p.i. These results indicate that NK cells 
target activated CD4 T cells mainly on the second and third day of 
infection, when the cytolytic activity of NK cells is at its peak’. 

These results show that NK cells can have a crucial role in control- 
ling virus-associated morbidity, mortality and persistence in the 
absence of direct NK-cell-mediated control of virus replication, and 
they do so by altering the numbers and polyfunctionality of virus- 
specific T cells. Their effect on activated CD4 T cells was presumably 
due to direct cytotoxicity, as demonstrated by rapid perforin-dependent 
elimination of activated CD4 T cells by NK cells in short-term in vivo 
cytotoxicity assays and the observation of enhanced annexin reactivity 
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Figure 4 | NK cells rapidly eliminate activated CD4 T cells. a-f, In vivo 
cytotoxicity assays were performed as described in Methods using donor cells 
from NK-cell-depleted wild-type (Ly5.1 *) (a-f) or Fas!" (Ipr) (d) mice 4 days 
p.i. with medium-dose LCMV (a-e) or other viruses (f). Recovery of donor T 
cells was examined 5h after transfer into control or NK-depleted (ANK) 
Ly5.2* mice (a, n = 21-28 per group; b-f, n = 3-5 per group). Host mice were 
uninfected or inoculated with medium-dose LCMV (day 3 p.i.) 

(a, ¢, d, f), various doses of LCMV (day 3 p.i.) (b), or PV (day 2), MHV (day 3) 
and polyI:C (pI:C) (day 1) (e). a, b, d-f, Lysis (mean + s.e.m.) of donor T cells in 
infected relative to uninfected recipient mice; control versus ANK mice, 

*P <0.05. WT, wild type. c, Annexin V reactivity of donor (Ly5.1*) and host 
(Ly5.1) CD4 T cells. g, Proposed model connecting NK-cell killing of CD4 T 
cells to CD8 T cells and infection outcome in the presence of NK cells. 


of CD4 T cells in the presence of NK cells. The effect of activated NK 
cells on CD8 T cells and virus clearance seemed to be indirect, and 
depended on the presence of CD4 T cells, which are known to produce 
factors that preserve CD8 T-cell viability and functionality’***. 
This suggests a three-way interaction, whereby NK cells suppress the 
CD4 T-cell response, thereby preventing augmentation of the CD8 
T-cell response, which, in turn, directly regulates viral clearance and 
immunopathology in this system (Fig. 4f). 

Our observation of direct NK-cell-mediated lysis of T cells during 
virus infection is distinct from published accounts of NK-cell regu- 
lation of antiviral T cells during MCMV infection, in which NK- 
cell-mediated lysis of virus-infected cells contributes to control of 
viral burden and persistence of MCMV-infected dendritic cells that 
in turn regulate activity of antiviral T cells*”’°. Moreover, we found no 
NK-cell-dependent changes in the number and antigen-presenting 
function of splenic dendritic cells during LCMV infection (Sup- 
plementary Fig. 9), consistent with our finding that NK cells directly 
regulated the T cells. A possible concern was that in vivo cellular 
depletion with antibodies against NK1.1 and CD4 have the potential 
to affect frequencies of NKT, yo T and regulatory T cells, but the 
frequencies of these lymphocytes in the spleen of LCMV-infected mice 
were not altered after 3 to 4 days of infection by the low concentration 
of anti-NK1.1 used in these studies (Supplementary Figs 1 and 10). 
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Moreover, depletion of NK cells in y6 T-cell-deficient (Terd~/~) and 
NK T-cell-deficient (Cdld ’) mice during medium-dose infection 
enhanced LCMV-specific T-cell responses and reduced viral loads 
(Supplementary Fig. 9), similar to that in wild-type mice. Therefore, 
these lymphocyte lineages seem to be dispensable for NK-cell immuno- 
regulatory function during LCMV infection. 

By adjusting the dose of the LCMV inocula, it is possible to generate 
diverse patterns of CD8 T-cell-regulated pathogenesis, similar to the 
variety of pathogenic patterns a human HCV infection can take, 
including rapid viral clearance, severe T-cell-dependent immuno- 
pathology and long-term persistence. We show here that at a high dose 
of LCMV, NK cells act beneficially by suppressing T-cell responses, 
thereby preventing severe pathology and mortality while enabling the 
development of a persistent infection from which mice eventually 
recover and clear the virus”’. At the medium-dose inoculum, NK-cell 
suppression of T cells is detrimental to the host, as virus clearance is 
impaired due to the limited number and functionality of T cells. 
However, a medium dose of virus is not sufficient for complete clonal 
exhaustion of T cells, ultimately resulting in severe T-cell-dependent 
immunopathology that can lead to death of the host. 

These results indicate that NK cells can serve as rheostats, or master 
regulators, of antiviral T-cell responses. Consistent with the fact that 
many virus infections induce cytokines that potently activate NK 
cells*®, we found that NK-cell lysis of activated CD4 T cells was 
triggered by several viruses as well as after inoculation with polyI:C, 
which induces interferon and activates NK cells. Although a previous 
study found that NK-cell depletion did not alter the magnitude of 
antiviral T-cell responses during infection with the Armstrong strain 
of LCMV”, we have observed enhanced antiviral T-cell responses and 
improved viral control at early time points after infection of NK-cell- 
depleted mice with both LCMV Armstrong and Pichinde virus 
(S.N.W., unpublished observations). Thus, the timing and the type 
of evaluation may be important to detect detrimental effects of NK 
cells on T cells during more benign viral infections. 


METHODS SUMMARY 

Infection model. One day before infection, male C57BL/6 mice were selectively 
depleted of NK cells through a single intraperitoneal (i-p.) injection of 25 pg anti- 
NK1.1 monoclonal antibody (PK136) or a control mouse IgG2a (both from Bio-X- 
Cell), as previously described”! (Supplementary Fig. 1). In some cases, mice were 
also depleted of CD4 T cells by ip. injection of 100 yg anti-CD4 (GK1.5) at days 
—1 and +3 of infection. Mice were then infected i.v. with 5 X 10* (low dose), 
2X 10° (medium dose) or 2 X 10° (high dose) p.fu. of the clone 13 variant of 
LCMV. Virus was titrated by plaque assay on Vero cells. In some experiments, 
mice were inoculated i.p. with 1.5 X 10” p.f.u. of PV, 8 X 10° p.f.u. of MHV strain 
A59, 1 X 10° p.f.u. of VV strain Western Reserve, 1 X 10° p.f.u. of Smith strain 
MCMV, or 200 ig of polyl:C (Invivogen). 

Immune assays. The number of LCMV-specific T cells was measured by H-2D>- 
NP3o¢-404 tetramer staining or by intracellular cytokine staining after 5h ex vivo 
stimulation with 1 1M viral peptide in the presence of brefeldin A. T-cell cytolytic 
activity was measured in vivo as described previously”. 

In vivo NK-cell assay. An unconventional in vivo cytotoxicity assay was previ- 
ously established to determine NK-cell killing of lymphocyte populations”. Donor 
mice were depleted of NK cells and then infected iv. or ip. with different viruses. 
At day 4 p.i., single-cell splenocyte suspensions were prepared from these mice, 
labelled with 2 uM CFSE, and transferred (2 X 10) into various strains of NK-cell- 
depleted or control recipient mice that were either uninfected or had been infected 
with virus 1-5 days previously. Spleens of recipient mice were harvested 5h after 
transfer and assessed for survival of donor T cells. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 
Mice. C57BL/6, Thyl.1*, Tcrd~/—, Tcrb-’~, Fas'", Prft-'~ and uMT/~ mice 
were purchased from The Jackson Laboratories. Ly5.1* mice were from Taconic 
Farms. Nkg2d /~ and Cdld ’~ mice on a C57BL/6 background were obtained 
from B. Polié*® and M. Exley*!, respectively. Congenic (Ly5.1~) TCR transgenic 
P14 (ref. 32) mice and Tnf~ ‘~ mice on a C57BL/6 background were bred at the 
University of Massachusetts Medical School (UMMS). Male mice at 6-16 weeks of 
age were routinely used in experiments. Mice were maintained under specific 
pathogen-free conditions, and experiments were performed in compliance with 
institutional guidelines as approved by the Institutional Animal Care and Use 
Committee of UMMS. 
Virus infections and in vivo cell depletions. The clone 13 variant of LCMV was 
propagated in baby hamster kidney BHK21 cells? and titrated by plaque assay on 
Vero cells. Mice were infected i.v. with 5 X 10* (low dose), 2 X 10° (medium dose), 
or 2 X 10° (high dose) p.f.u. of LCMV. Selective depletion of NK cells was achieved 
through a single i.p. injection of 25 tg anti- NK1.1 monoclonal antibodies (PK136) 
or a control mouse IgG2a produced by Bio-X-Cell, as previously described?! 
(Supplementary Fig. 1). Alternatively, mice received a carefully titrated dose of 
10 pl of anti-asialo GM1 antibody (Wako Biochemicals) diluted in 200 pl PBS i.p. 1 
day before virus infection. Anti- NKG2D monoclonal antibody (CX5) was a gift of 
L. Lanier, and 200 1g was injected i.p. at the time of infection. Mice were depleted 
of T cells by ip. injection of either 100 pg anti-CD4 (GK1.5) or 50 pg anti-CD8 
(2.43) produced by Bio-X-Cell at day —1 and day +3 of infection. In some 
experiments, mice were inoculated i.p. with 1.5 X 10’ p.f.u. of PV, 8 X 10° p.f.u. 
of MHV strain A59, 1 X 10° pfu. of VV strain Western Reserve, 1 X 10° p-f.u. of 
Smith strain MCMV, or 200 ug of polyl:C (Invivogen). 
Tetramers and peptides. T-cell epitopes encoded by LCMV include NP 396-404 
(FQPQNGOQFI), GP33-41 (KAVYNFATC) and GP¢\-s9 (GLKGPDIYKGVYQF 
KSVEFD)****. Peptides were purchased from 21st Century Biochemicals and puri- 
fied by reverse phase-HPLC to 90% purity. H-2D°-NP39¢_494 tetramers were pre- 
pared as described**. CD1d-PBS57-allophycocyanin tetramers provided by NIAID 
Tetramer Facility were a gift from L. Berg. 
Antibodies and FACS analysis. Fluorescently labelled antibodies and reagents 
were purchased from BD Biosciences, eBioscience, BioLegend and R&D 
Biosystems. Flow cytometric analyses of cells were performed on a LSR II 
cytometer (BD Biosciences) equipped with FACSDiva software and data were 
analysed using FlowJo software (Tree Star). 
CFSE labelling and adoptive transfer. Spleens from donor mice were mechanically 
disrupted, and erythrocytes were lysed using a 0.84% NH,Cl solution in order to 
generate single-cell leukocyte suspensions. Cells were labelled for 15 min at 37 °C with 
the 2 11M fluorescent dye CFSE (CFDA-SE, Molecular probes), washed, and trans- 
ferred iv. (3 X 10’ cells) to recipient mice. 
In vivo cytotoxicity assays. T-cell cytolytic activity was measured in vivo as 
described previously". Briefly, single-cell suspensions were prepared from spleens 
of uninfected mice, and separate fractions of cells were then loaded with LCMV 
peptides (1 1M) for 45 min at 37 °C before labelling with CFSE (2.5, 1 or 0.4 1M, 
Molecular Probes) for 15 min at 37°C. After washing, these populations were 
combined at equal ratios and transferred i.v. into either naive or infected recipients. 
Survival of each transferred population in the spleens of recipient mice was 
assessed 16h after transfer. Specific lysis was calculated as follows: 100 — ((% 
LCMV target population in infected experimental/% unlabelled population in 
infected experimental) + (% LCMV target population in naive control/% 
unlabelled population in naive control)) 100). 

An unconventional in vivo cytotoxicity assay was previously established to 
determine NK-cell killing of lymphocyte populations in vivo~'. Wild-type or 
Fas'?" donor mice were depleted of NK cells and then infected with VV, 
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MCMV, MHYV, PV, or a medium dose of LCMV clone 13. At day 4 p.i., single 
cell splenocyte suspensions were prepared from these mice, labelled with CFSE, 
and then transferred (2 X 10’) into experimental recipient mice on day 3 of 
medium dose LCMV infection, unless otherwise noted. Recipients included 
WT, Prfl~! ~, Tnf, or Nkg2d~’ ~ mice that were administered anti-NK1.1 or 
isotype control antibodies one day before inoculation with PV, MHC, polyI:C, or 
various doses of LCMV clone 13. Some recipient mice were uninfected and served 
as controls. Spleens of recipient mice were harvested 5 h after transfer and assessed 
for survival of donor T cells. 

In vitro antigen presentation assay. Stimulator cells were prepared by isolation 
of single-cell suspensions from the spleens of uninfected as well as isotype-treated 
or anti-NK1.1-treated mice infected 3 days previously with a medium dose of 
LCMV iv. Following irradiation, stimulator cells (5 X 10*) were plated at a 1:10 
ratio with CFSE-labelled Ly5.1* LCMV-specific P14 CD8 T cells (5 X 10°) in 
T-cell stimulation medium (RPMI supplemented with 100 U ml’ penicillin G, 
100 pg ml! streptomycin sulphate, 2mM .-glutamine, 10 mM HEPES, 1mM 
sodium pyruvate, 0.1 mM non-essential amino acids, 0.05 mM 2-mercaptoethanol 
and 10% heat-inactivated (56 °C, 30 min) FBS), which was refreshed every 2 days. 
P14 cells were enumerated and analysed for dilution of CFSE as a measure of 
proliferation every 24h after initiation of co-culture. 

Lymphocyte preparation and intracellular cytokine assay. Single-cell leukocyte 
suspensions from spleens, inguinal lymph nodes, lung and liver were prepared as 
described previously”! and were plated at 2 X 10° cells per well in 96-well plates. 
Cells were stimulated for 5 h at 37 °C with either 1 1M viral peptide or 2.5 ug ml! 
anti-CD3 monoclonal antibody in the presence of brefeldin A and 0.2Uml ! 
rhIL-2. Stimulated cells were then pre-incubated with a 1:200 dilution of Fc 
Block (2.4G2) in FACS buffer (HBBS, 2% FCS, 0.1% NaN3) and stained for 
20min at 4°C with various combinations of fluorescently tagged monoclonal 
antibodies. After washing, cells were permeabilized using BD Cytofix/Cytoperm 
solution and then stained in BD Permwash using monoclonal antibodies specific 
for various cytokines. AnnexinV staining was performed in azide-free FACS buffer 
directly ex vivo according to manufacturer’s instructions (BD Biosciences). 
Statistical analysis. Results are routinely displayed as mean = s.e.m., with statistical 
differences between experimental groups determined using a two-tailed unpaired 
Student’s t-test, where a P value of <0.05 was deemed significant. Statistical differ- 
ences in survival were determined by log rank (Mantel-Cox) analysis. Graphs were 
produced and statistical analyses were performed using GraphPad Prism. 
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Field hospitalit 


Hosting guests on research trips can give scientists a chance 
to showcase their work — but it can also cause distractions. 


BY LUCAS LAURSEN 


arly in his career, Paul Olsen sat in front 
EB: a television, expecting to see his own 

image. He had hosted a television crew 
on a research expedition to Manicouagan 
Crater in Canada, where he and his team were 
investigating the Triassic-Jurassic boundary 
in the geological record. Olsen, a palaeontolo- 
gist at Lamont-Doherty Earth Observatory of 
Columbia University in Palisades, New York, 
had spent hours explaining and re-explaining 
for the camera how scientists used the site to 
reconstruct ancient ecologies. As the opening 
credits rolled, Olsen wondered how he would 
come across on the small screen. 

But by the end credits, he was still won- 
dering. All that had appeared on screen was 
a brief flash of Olsen’s disembodied hand, as 
the documentary focused on other research- 
ers from the expedition. In the decades since 


then, Olsen has hosted at least a dozen teams 
of journalists, from National Geographic, the 
BBC and the Canadian Broadcasting Corpora- 
tion among others, each time devoting days to 
assisting them and solving logistical challenges 
on their behalf. He has learned that a journal- 
ist’s gratitude does not always translate into a 
starring role — and nor should it. Although 
some of his colleagues express disappoint- 
ment when their hours of storytelling don't 
make the final cut of a documentary, Olsen 
acknowledges that journalists must be “fairly 
ruthless’, because “they're there to tell a story”. 
Scientist hosts might find that they’re not part 
of that tale. 

Hosting guests such as journalists, policy- 
makers or teachers during research fieldwork 
is a gamble. In exchange for a shot at publiciz- 
ing their work or educating a wide audience, 
scientists must give up an unknowable amount 
of time and accept the risk — and the added 


cost — of taking extra people to a remote loca- 
tion. If it goes well, a visiting teacher might 
help to spread the word, or a policy-maker 
might be able to make more informed deci- 
sions; but scientists may also end up with 
recalcitrant or shivering visitors who hole up 
in their rooms instead of getting close to the 
fieldwork. To make the most of bringing guests 
to the field, hosts should plan ahead, commu- 
nicate their goals, try to head off conflicts and 
expect to exercise patience. 


BE PREPARED 

Hosting may give researchers an opportunity 
to sell their science. “It theoretically helps your 
career, says Kevin Krajick, a media-relations 
manager and science writer at Lamont- 
Doherty, who helps to connect journalists with 
scientists. “It’s a bit of marketing — organiza- 
tions such as the National Science Foundation 
love to see media — and it can help educate the 
general public.’ 

The impact may be more important for gen- 
eral fields of study than for any one researcher's 
work. “I don't think there are too many ben- 
efits directly to the scientist,” says Olsen. Yet 
he continues to host guests owing to a sense 
of responsibility to members of the public, 
whose taxes fund most of his work. Jill Baron, 
an ecologist at the US Geological Survey in 
Fort Collins, Colorado, says that bringing 
policy-makers closer to the wilderness helps to 
persuade them that such places are worth pro- 
tecting. Visits can also establish useful relation- 
ships. “When they need information they are 
likely to call you,’ says Baron, who has hosted 
policy-makers ranging from local air-quality 
commissioners to US senators. 

Communicating science in the field can 
help to further a cause and even, in rare cases, 
catalyse research-vindicating policies. In 2007, 
after decades of visits by Colorado officials to 
Barons field sites, a consortium including the 
US Environmental Protection Agency, Colo- 
rado’s air-quality commission and the US 
National Park Service drafted a plan to reduce 
nitrogen deposition in the Rocky Mountain 
National Park. Baron says that without such 
visits, “it’s hard for people to actually visualize” 
environmental threats. She once spent hours 
explaining to a mayoral staffer from Denver 
how agricultural fertilizer used on Colorado's 
plains infiltrated the Rocky Mountains. After 
walking all day through the Rockies, the staffer 
stopped and asked her: ““You mean what we do 
down in Denver makes a difference up here?” 
Baron recalls. “You can talk until you're blue > 
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Left, palaeontologist Paul Olsen (kneeling) hosting a camera crew in Morocco. Right, Kevin Krajick, a science writer, (left) on a field trip in Bangladesh. 


> but until they see it they won't understand,” 
she says. 

When David McGee, a climatologist at the 
Massachusetts Institute of Technology in Cam- 
bridge, hosted freelance writer Douglas Fox 
on a field trip, he expected to teach Fox about 
the science involved in his work. But McGee 
was surprised by what he learned from the 
experience himself. “It improved my ability 
to talk about what I do,” he says. “You get bet- 
ter at breaking out of the jargon and lingo and 
assumptions about what is important” to out- 
siders (see Nature 468, 465-467; 2010). McGee 
took Fox, who is based in San Francisco, Cali- 
fornia, on a trip to the dry bed of Lake Bonne- 
ville, which once covered much of what is now 
northern Utah. McGee says he was excited that 
someone was interested in his work and helping 
to publicize it. The expedition led to a feature 
story in High Country News, a magazine for the 
western United States (go.nature.com/qiiyqj); 
McGee now shares the article with prospective 
graduate students and postdocs. 

As useful as these outreach lessons were, 
they did cause frustrating and time-consuming 
complications. One of McGee’s Utah field 
sites was on US military property. McGee had 
arranged permits long before the expedition 
— for scientists. But when his military contacts 
discovered that Fox was joining the trip, they 
decided that the team would need a military 
supervisor. They also decided that they were 
willing to send the supervisor for only one 
day. So Fox missed out on some of the expedi- 
tion’s scientific research and McGee got stuck 
with lengthy, last-minute negotiations. “Once 
you've invited this person along and they’ve 
got their plane tickets, you end up with some 
responsibility to help as an intermediary,’ he 
says. “It all worked out in the end, but it was 
a headache” 

McGee also learned that journalists are 
attracted to the little conflicts that scientists 


are less eager to broadcast. With Fox around 
all the time, every prickly exchange among 
the researchers, no matter how routine, was 
potential story material. “Sometimes with col- 
laborators you're short about someone's differ- 
ence of opinion,’ says McGee. “But that’s not 
the thing you want to see in print.” As a result, 
some of his colleagues were not comfortable 
with Fox’s presence, and McGee now knows 
that he needs to check with other members of 
his team before taking a journalist along. 


LESSON PLANNING 

A proactive approach before the field visit 
can help to mitigate misunderstandings, says 
Krajick. “You want to be prepared,” he warns 
scientists before they take visitors on trips. 
“Be ready to explain your stuff and have a talk 
over the phone before going on the expedition 
to make sure you're on the same page.” That 
allows scientists to lay ground rules such as 
when they will be available for questions; and 
it gives journalists a chance to explain when in 
the editorial process they might want to con- 
sult the scientist to check facts. 

Olsen discusses mutual expectations with all 
guests ahead of time. He also prepares hand- 
outs for teachers and journalists, summarizing 
the main points of interest of the field site and 
the objectives of the expedition. Baron and her 
students and technicians separate groups of 
policy-makers into different hiking parties, on 
the basis of their physical abilities. “Don't take 
staffers or policy-makers or senators, especially 
senators, anywhere they’re going to be cold or 
thirsty or get altitude sickness,’ she advises. 

It helps for scientists to have some control 
over who goes into the field with them. “One 
of the things we have learned in the past is that 
the researchers need to be able to pick their 
teacher,” says Janet Warburton, an education 
coordinator at PolarTREC, a programme 
based in Fairbanks, Alaska, that gives school 
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teachers a hands-on research experience in 
the polar regions. PolarTREC filters applicants 
for motivation and the ability to translate their 
experience into lesson plans; researchers them- 
selves must ensure that the teacher will be able 
to cope with the challenges of life and work 
on their field trip. Lack of filtering can cause 
wasted opportunities: in one unfortunate case, 
says Warburton, a teacher not up to the physi- 
cal challenge of the daily walk to a field site 
remained in the barracks instead of participat- 
ing in the fieldwork. 

Mark Goldner, a teacher at Heath School 
in Brookline, Massachusetts, learned about 
research expectations first hand. Goldner was 
shortlisted for an expedition to the Svalbard 
archipelago in the Norwegian Arctic last July. 
Before he could join the trip, he had to pass a 
thorough interview with the principal investiga- 
tor — Julie Brigham-Grette, a palaeoclimatolo- 

gist at the University 


“Be ready to of Massachusetts in 
explain your Amherst. “Julie came 
stuff and make right out and said, 
sure you ’reon ‘Here’s sae things 
» we expect of you — 

ne UNEP UEE: what do you think?” 
Goldner remembers. 


As a trip leader, Brigham-Grette always 
tells potential participants in her field trips 
about the physical demands of excursions, 
such as long walks in the cold, and asks them 
about their ability to handle responsibili- 
ties that include blogging about the trip and 
helping with expedition equipment. She had 
one teacher who wasn't able to operate his 
computer or blog effectively on a trip — and 
because Brigham-Grette considers communi- 
cation an important part of a visiting teacher's 
role, she now asks about those skills. “When 
Tm in the field, I car’t help them navigate that;” 
she says. 

Meeting in advance can mitigate stress and 
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distractions once the trip is under way — in 
the field, it is sometimes hard for scientists to 
explain the big picture of the research effort 
because they are so caught up in the mechan- 
ical details of data collection. “When teachers 
are going out in the field, they're seeing only 
one piece of the process,’ says Warburton. 
“The team is usually very stressed collecting 
data and may not have the energy to explain 
the big picture in the moment.” 

For teacher trips, pre-planning should 
entail some thought about how the experi- 
ence will benefit the classroom. Before his 
own expedition to Siberia with Brigham- 
Grette, Tim Martin, a teacher at Greensboro 
Day School in North Carolina, discussed 
with her how his web design and photogra- 
phy skills could help to translate her science 
into useful lessons for his students. He also 
improved his understanding of her work. 

Since their expedition together, Brigham- 
Grette and Goldner have stayed in touch so 
that she can share her group’s analyses of the 
data that they collected. “The whole goal of 
PolarTREC shouldn't be about me, or my trip 
to the Arctic, it’s really about my students 
and the outreach that I can do,” says Gold- 
ner. “That ongoing collaboration is really 
important.” 


ON THE RECORD 
Allowing guests, especially journalists, to 
participate in expeditions may not be the 
best choice for every expedition. Anything 
from precarious logistics to bad weather or 
sensitive politics might cause scientists to 
postpone, says Olsen. Krajick adds, “If you're 
not prepared to go on the record all the time, 
I don't think you should take someone along.” 
Most fieldwork takes place in too intimate a 
setting to expect much control over gossip or 
frayed nerves. 

Sharing the experience doesn't have to 


mean relinquishing all control. Olsen has 
experimented with one way of dealing with 
the time demanded by television crews: he has 
decided to not bother trying to conduct real 
science for the cameras. Mock observations 
are enough for television journalists, and not 
having to worry about botching precise meas- 
urements makes answering questions easier. 

Indeed, hosting often requires flexibility. 
“You cant plan which days are for discovery 
and which are photo days,” says McGee. He 
acknowledges that Fox’s visit was rather long 
at four days, but McGee liked the result — a 
nuanced depiction of his work. 

Field guests can be an asset, even beyond 
outreach: scientists may be able to put visitors 
to work and make use of their skills. When 
Brigham-Grette first offered him a place on 
her trip, Martin was “ready to mop floors”. 
But the team found more useful things for 
him to do: he had once worked building 
houses in needy communities, and had con- 
struction skills. On one occasion, Brigham- 
Grette assigned Martin to work on a drill. 
He didn't falter when the sub-freezing tem- 
peratures chilled the drill’s fluids and shut it 
down. Instead, he assumed the role of fore- 
man. “We brought it inside for warmth,” he 
recalls. “But the exhaust was poisonous, so 
I built an enclosure to channel the exhaust 
outside” 

Goldner happened to know how to drive 
boats, so Brigham-Grette asked him to ferry 
scientists and equipment around. And on 
a remote field site in Morocco, Olsen once 
asked a television crew with a large budget 
to transport some of his team members 
between sites. When it comes to field-trip 
guests, he says, “their needs and your needs 
can overlap,’ = 


Lucas Laursen is a freelance journalist 
based in Madrid, Spain. 
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JOB SECURITY 

British prospects grim 
Recruitment and job security for early- 
career faculty members in the United 
Kingdom will become more uncertain as 
government funding for higher education 
declines, says a trade union. The University 
and College Union (UCU) in London 
released a study of higher-education 
revenues on 5 January. The report says 

that the proportion of university income 
accounted for by government funding, 
estimated to be 31.6% for 2010-11, will 

fall to 15% by 2014-15. That will probably 
lead to a reduction in long-term, renewable 
contracts for junior faculty members, says 
Stephen Court, the UCU’s senior research 
officer. “The shifting pattern of income 
from the government will make university 
employment more precarious,’ he says. 


BIOMEDICAL SCIENCE 


Jackson Lab to expand 


The Jackson Laboratory has completed 
plans to opena branch in Connecticut. On 
5 January, state governor Dannel Malloy 
finalized an agreement with Jackson, a 
biomedical research centre based in Bar 
Harbor, Maine. The lab will initially lease 
space from the University of Connecticut 
Health Center in Farmington. Next year, 
construction will begin on a permanent 
facility in Farmington that is set to employ 
300 scientists and support staff by 2017. 
Recruitment is already under way in 
computational biology and systems 
genomics for the temporary lab, which will 
have 27 employees, including scientists, 
by the end of 2012, says Michael Hyde, 
Jackson's vice-president for advancement. 


UNITED STATES 
Scientists miss their peak 


US biomedical scientists rarely earn their 
first major grants during their optimum 
innovative years, concludes a study 

(K. R. W. Matthews et al. PLoS ONE 6, 
€29738; 2011). In 2008, the average age 

of a scientist getting a first grant from the 
US National Institutes of Health (NIH) 
was 42, the authors found. But researchers 
who won Nobel prizes in medicine or 
chemistry between 1980 and 2010 did 
their pioneering work at an average of 

41 years; 78% did so before 51, the average 
age of NIH investigators now. Part of the 
problem is that the NIH is risk-averse and 
unwilling to fund nascent work, argues 
Kirstin Matthews, lead author of the study 
and a science and technology policy fellow 
at Rice University in Houston, Texas. 
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distractions once the trip is under way — in 
the field, it is sometimes hard for scientists to 
explain the big picture of the research effort 
because they are so caught up in the mechan- 
ical details of data collection. “When teachers 
are going out in the field, they're seeing only 
one piece of the process,’ says Warburton. 
“The team is usually very stressed collecting 
data and may not have the energy to explain 
the big picture in the moment.” 

For teacher trips, pre-planning should 
entail some thought about how the experi- 
ence will benefit the classroom. Before his 
own expedition to Siberia with Brigham- 
Grette, Tim Martin, a teacher at Greensboro 
Day School in North Carolina, discussed 
with her how his web design and photogra- 
phy skills could help to translate her science 
into useful lessons for his students. He also 
improved his understanding of her work. 

Since their expedition together, Brigham- 
Grette and Goldner have stayed in touch so 
that she can share her group’s analyses of the 
data that they collected. “The whole goal of 
PolarTREC shouldn't be about me, or my trip 
to the Arctic, it’s really about my students 
and the outreach that I can do,” says Gold- 
ner. “That ongoing collaboration is really 
important.” 


ON THE RECORD 
Allowing guests, especially journalists, to 
participate in expeditions may not be the 
best choice for every expedition. Anything 
from precarious logistics to bad weather or 
sensitive politics might cause scientists to 
postpone, says Olsen. Krajick adds, “If you're 
not prepared to go on the record all the time, 
I don't think you should take someone along.” 
Most fieldwork takes place in too intimate a 
setting to expect much control over gossip or 
frayed nerves. 

Sharing the experience doesn't have to 


mean relinquishing all control. Olsen has 
experimented with one way of dealing with 
the time demanded by television crews: he has 
decided to not bother trying to conduct real 
science for the cameras. Mock observations 
are enough for television journalists, and not 
having to worry about botching precise meas- 
urements makes answering questions easier. 

Indeed, hosting often requires flexibility. 
“You cant plan which days are for discovery 
and which are photo days,” says McGee. He 
acknowledges that Fox’s visit was rather long 
at four days, but McGee liked the result — a 
nuanced depiction of his work. 

Field guests can be an asset, even beyond 
outreach: scientists may be able to put visitors 
to work and make use of their skills. When 
Brigham-Grette first offered him a place on 
her trip, Martin was “ready to mop floors”. 
But the team found more useful things for 
him to do: he had once worked building 
houses in needy communities, and had con- 
struction skills. On one occasion, Brigham- 
Grette assigned Martin to work on a drill. 
He didn't falter when the sub-freezing tem- 
peratures chilled the drill’s fluids and shut it 
down. Instead, he assumed the role of fore- 
man. “We brought it inside for warmth,” he 
recalls. “But the exhaust was poisonous, so 
I built an enclosure to channel the exhaust 
outside” 

Goldner happened to know how to drive 
boats, so Brigham-Grette asked him to ferry 
scientists and equipment around. And on 
a remote field site in Morocco, Olsen once 
asked a television crew with a large budget 
to transport some of his team members 
between sites. When it comes to field-trip 
guests, he says, “their needs and your needs 
can overlap,’ = 


Lucas Laursen is a freelance journalist 
based in Madrid, Spain. 
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JOB SECURITY 

British prospects grim 
Recruitment and job security for early- 
career faculty members in the United 
Kingdom will become more uncertain as 
government funding for higher education 
declines, says a trade union. The University 
and College Union (UCU) in London 
released a study of higher-education 
revenues on 5 January. The report says 

that the proportion of university income 
accounted for by government funding, 
estimated to be 31.6% for 2010-11, will 

fall to 15% by 2014-15. That will probably 
lead to a reduction in long-term, renewable 
contracts for junior faculty members, says 
Stephen Court, the UCU’s senior research 
officer. “The shifting pattern of income 
from the government will make university 
employment more precarious,’ he says. 


BIOMEDICAL SCIENCE 


Jackson Lab to expand 


The Jackson Laboratory has completed 
plans to opena branch in Connecticut. On 
5 January, state governor Dannel Malloy 
finalized an agreement with Jackson, a 
biomedical research centre based in Bar 
Harbor, Maine. The lab will initially lease 
space from the University of Connecticut 
Health Center in Farmington. Next year, 
construction will begin on a permanent 
facility in Farmington that is set to employ 
300 scientists and support staff by 2017. 
Recruitment is already under way in 
computational biology and systems 
genomics for the temporary lab, which will 
have 27 employees, including scientists, 
by the end of 2012, says Michael Hyde, 
Jackson's vice-president for advancement. 


UNITED STATES 
Scientists miss their peak 


US biomedical scientists rarely earn their 
first major grants during their optimum 
innovative years, concludes a study 

(K. R. W. Matthews et al. PLoS ONE 6, 
€29738; 2011). In 2008, the average age 

of a scientist getting a first grant from the 
US National Institutes of Health (NIH) 
was 42, the authors found. But researchers 
who won Nobel prizes in medicine or 
chemistry between 1980 and 2010 did 
their pioneering work at an average of 

41 years; 78% did so before 51, the average 
age of NIH investigators now. Part of the 
problem is that the NIH is risk-averse and 
unwilling to fund nascent work, argues 
Kirstin Matthews, lead author of the study 
and a science and technology policy fellow 
at Rice University in Houston, Texas. 
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A GAME OF SELF-DECEIT 


BY CLAYTON LOCKE 


awning, collapsing and shaking his 

umbrella, and step into the lazy smoke 
of the izakaya. When he raised his gaze to 
sweep the diners and drinkers at their tables, 
it took me a moment to work out what was 
wrong. Then I gotit: I was looking at myself, 
notas I usually did, left-right inverted in the 
mirror, but as I saw myself in photographs. 
It’s funny that my brain should pick out such 
a subtle detail. He saw me sitting 
at one of the outer tables, next 
to the plastic sheeting turned 
opaque with hammering rain. 
Then, as he moved away from 
the door, another figure entered. 
He was with my wife. 

His greeting bow was the 
slightest nod. “Konbanwa? 

“Why did you bring her, you 
fool?” Without thinking I had 
addressed him using temee, an 
aggressive personal pronoun. I 
didn't even dare look at my wife, 
for fear of choking emotion. 

“Because, Koji-kun, I want 
everything we do here to be wit- 
nessed. I’m not underestimating 
you.” 

I composed my face into the 
ever-necessary mask of neutral- 
ity. He had taken me off-guard, 
and I silently reprimanded 
myself. I harboured a sense of 
superiority over him, although I 
knew it to be illogical. This man 
was exactly me. Well, almost. I 
had the memories of the mountains and the 
time of cold, the experience that has shifted 
my entire way of thinking. A way of thinking 
I intended to preserve. 

“This is illegal. You have to stay more than 
20 kilometres away from me,” he said. 

I couldn't help but laugh at the notion that 
a bystander would pick us as clones; a salary- 
man ina crisply tailored suit, and a man in 
ill-fitting rags. He was right though, yet still 
he came. That's good, it meant he was afraid. 
I flicked a glance at my wife. For a heartbeat 
our eyes met and the blood sang in my ears. 

“[ve got a proposal for you,’ I said. I nod- 
ded to the empty chairs at my table and after 
a moment of hesitation they sat. I reached 
down for the plastic bag beside my chair and 
placed it on the table between us. Inside were 
documents and photos. 


LE him duck under the cover of the 


The resurrection shuffle 


“So it’s blackmail? I credited myself with 
more imagination than this. Where did you 
get these?” He paused a moment: when he 
spoke again his voice was level. “What’s your 
problem‘ It’s legal, you've been given a new 
identity, you're still you. You're wallowing in 
this self-serving misery, don't look to me to 
pull you out of it!” 

“You don’t know the half of it,’ I said. I 
reflected on all that had brought me here, 
how I should never have gone mountain- 
eering without a backup hoverbelt. It was 


only when the primary one didn’t catch and 
Islipped and fell that Id realized how stupid 
Id been. I'd come to a stop in a deep ravine, 
my knees shattered, coughing blood. A blow 
to my head had cracked the personal smart- 
phone chip embedded in my skull. Suddenly 
I was cut off from the world, no Internet 
access, no emergency calls, and a silence 
like I'd not known in many years. I'd been 
missing for less than two days, but who could 
survive those subzero temperatures? Lost in 
the mist and rain, hearing the electric hiss 
of hover-copters as they glided overhead, 
and shouting myself hoarse. Id finally been 

found, but by the time 


DNATURECOM _ the gears of bureaucracy 
Follow Futures on had turned, nearly two 
Facebook at: weeks had passed, and 
go.nature.com/mtoodm — to the world I was dead. 
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How easily theyd discarded me. Life was 
cheap now, but death was cheaper. Already 
a clone body had been grown and my lat- 
est memory backup imprinted on the fresh 
brain. It wasn't possible to re-enter my life, 
for my new self had all the rights and iden- 
tity: everything from bank accounts and job 
qualifications to friends and my wife. 

I had changed in those weeks. Although 
I cursed myself for my stupidity, I held the 
experience in the mountains as the most 
important thing that had ever happened to 
me. I wanted to keep it and shape 
my life with it as part of me. I'd 
spent long nights seething with 
jealousy at the imposter taking 
over my life. Then I had a plan, 
and the very next day I spent 
the last yen of my pension ona 
full up-to-the-minute memory 
backup at a professional storage 
agency. 

I blinked back into the pre- 
sent. The new me snatched 
across the table and tried to grab 
the plastic bag but I caught his 
wrist and in a moment we were 
both on our feet and struggling. 
In the confusion I pulled the 
gun from my rear pocket. This 
is where my preparations paid 
off, my risky burglary of what 
had once been my own home to 
retrieve this very weapon. 

The gun was between us now 
(it was an evenly matched fight, 
I observed wryly) and I senseda 
crowd gathering. I twisted back- 
wards, keeping my hold on him, 
and we fell to the ground, and the gun was 
pointed at my own temple, my finger around 
the trigger. I didn’t think I'd have the cour- 
age, but now in the moment it was all too 
easy. 

“Murderers dont come back; I said. I saw 
his eyes widen as he recognized his pistol, the 
realization hed been out-played. He knew 
what I knew, the death penalty was sure 
and a criminal would not be resurrected. It 
would be the real me, who had lived through 
the mountain cold, who would live again. He 
tried to withdraw his hands from the pistol 
but it was too late. The muzzle tight against 
the side of my head, I pulled the trigger. m 


Clayton Locke is an Australian physicist 
currently researching frequency standards 
and metrology in Tokyo. 


JACEY 


Hail ees. 


doi:10.1038/nature10720 


Pathway complexity in supramolecular 


polymerization 


Peter A. Korevaar?, Subi J. George’, Albert J. Markvoort’”, Maarten M. J. Smulders!?, Peter A. J. Hilbers!?, 
Albert P. H. J. Schenning”, Tom F. A. De Greef!?? & E. W. Meijer? 


Self-assembly provides an attractive route to functional organic mate- 
rials, with properties and hence performance depending sensitively 
on the organization of the molecular building blocks'*. Molecular 
organization is a direct consequence of the pathways involved in the 
supramolecular assembly process, which is more amenable to detailed 
study when using one-dimensional systems. In the case of protein 
fibrils, formation and growth have been attributed to complex 
aggregation pathways* * that go beyond traditional concepts of homo- 
geneous” "' and secondary” “ nucleation events. The self-assembly of 
synthetic supramolecular polymers has also been studied and even 
modulated’**, but our quantitative understanding of the processes 
involved remains limited. Here we report time-resolved observations 
of the formation of supramolecular polymers from 1-conjugated oli- 
gomers. Our kinetic experiments show the presence of a kinetically 
favoured metastable assembly that forms quickly but then transforms 
into the thermodynamically favoured form. Quantitative insight into 
the kinetic experiments was obtained from kinetic model calculations, 
which revealed two parallel and competing pathways leading to 
assemblies with opposite helicity. These insights prompt us to use a 
chiral tartaric acid as an auxiliary to change the thermodynamic pref- 
erence of the assembly process’. We find that we can force aggrega- 
tion completely down the kinetically favoured pathway so that, on 
removal of the auxiliary, we obtain only metastable assemblies. 

The m-conjugated oligomer S-chiral oligo(p-phenylenevinylene) 
(SOPV, Fig 1a) serves as a functional material in a variety of organic 
electronic devices, with performance critically dependent on the 
material’s morphology”. This morphology is determined by the 
assembly mechanisms that transform the molecular dissolved 
monomers into the materials used in devices. The hydrogen-bonded 
dimers of SOPV self-assemble in apolar solvents via n-n interactions 
into one-dimensional helical stacks, and earlier circular dichroism (CD) 
and ultraviolet-visual spectroscopy studies under thermodynamic 
control revealed exclusive formation of left-handed M-type helical 
aggregates (M-SOPV) through a nucleated growth mechanism”’. 
In practice, however, the assembly process is often under kinetic 
control—prompting us to examine in detail the supramolecular 
polymerization process under non-equilibrium conditions. 

Upon rapid quenching of SOPV from the molecularly dissolved 
state to 273 K, we observed the formation of a mixture of M-SOPV 
and aggregates with opposite helicity, as evidenced by the opposite sign 
of the bisignated Cotton effect (Fig. 1b). At 298 K, these right-handed 
P-type aggregates (P-SOPV) slowly converted into thermodynamically 
stable M-SOPV aggregates (Supplementary Fig. 1). The observation of 
metastable P-SOPV aggregates indicated that the supramolecular 
polymerization of SOPV involves two different aggregation pathways, 
which we termed the on-pathway (leading to M-SOPV) and off- 
pathway (leading to P-SOPV) (Fig. Ic). 

To study the aggregation mechanism of SOPV and quantify the self- 
assembly pathways under kinetic control, we conducted stopped-flow 


experiments in which a concentrated solution of molecularly dissolved 
SOPV in chloroform was mixed with an excess of methylcyclohexane 
(MCH) to initiate self-assembly. The subsequent formation of helical 
SOPV aggregated in MCH was probed using CD spectroscopy 
(Supplementary Discussion 1). These kinetic experiments were con- 
ducted at 293 K and 308 K, with different concentrations of SOPV. At 
293K and the lowest SOPV concentrations, the rate of aggregate 
formation initially increased with time, characteristic of a lag phase 
(Fig. 2a, c). The time-dependent CD signal was always negative under 
these conditions, suggesting the direct formation of thermodynamically 
stable on-pathway M-SOPV aggregates. At higher concentrations 
(29 1M) a positive CD signal appeared in the initial stages of the 
assembly process and then developed into a negative CD signal at later 
times, suggesting the initial formation of off-pathway P-SOPV aggre- 
gates that then convert into thermodynamically stable M-SOPV aggre- 
gates. Remarkably, the time at which 50% of the aggregation process 
was completed (t— 50) is longer at 15M than at 10 uM (Fig. 2d). 
Analogous kinetic studies at 308 K also revealed the presence of a lag 
phase in experiments at the lowest concentrations (Fig. 2b, e) and an 
inverted dependence of t — 50 on concentration, although the shortest 
t— 50 time shifts to a higher SOPV concentration (20 UM; Fig. 2d). 
Detailed analyses unambiguously prove that both the on- and off- 
pathway are formed via a homogeneously nucleated growth mech- 
anism (Supplementary Figs 5 and 6). 

To rationalize the experimental aggregation kinetics, we extended 
models known in the field of protein fibrillization®”"’ by incorporating 
two competing, nucleated assembly pathways in which prenucleus 
oligomers (oligomer aggregates below the critical nucleation size) 
and helical aggregates change size through monomer association 
and dissociation (Fig. 3a, Supplementary Discussion 2). For the 
M-SOPV aggregates, non-helical prenucleus oligomers change size 
with rate constants for association and dissociation of a and b, respec- 
tively. Once the critical nucleus (with size n) is reached, the helical 
aggregates are in the elongation regime and change size through 
monomer association with the same rate constant a, while dissociation 
proceeds with rate constant c. The steady-state concentration of the 
aggregates in the nucleation phase is determined by the nucleation 
equilibrium constant K, = a/b, while in the elongation phase it is 
determined by the elongation equilibrium constant K, = a/c. The 
nucleated growth of off-pathway P-SOPV aggregates is described 
analogously, with nucleus size n*, rate constants a*, b* and c*, and 
equilibrium constants K,,* and K,*. An essential model assumption is 
that the transition from metastable to thermodynamically stable aggre- 
gates occurs via depolymerization of P-SOPV aggregates and sub- 
sequent growth of M-SOPV aggregates. This is justified by the high 
helix reversal penalty (8.1 kgT, where ky is the Boltzmann constant and 
T is temperature), obtained by ‘majority rules’ experiments, which 
rules out intrastack stereomutation as an alternative transition mech- 
anism (Supplementary Fig. 8)”. 
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Figure 1 | Pathway complexity in supramolecular polymerization of SOPV. 
a, Molecular structure of SOPV with S-chiral butoxy side chains. b, CD spectra 
of SOPV in MCH in disassembled state (343 K, black line), thermodynamically 
stable M-SOPV (273 K, blue line) and mixture of M-SOPV and metastable 
P-SOPV (273K, red line). c, Schematic representation of the aggregation 
pathways of SOPV, including the growth of two competing assemblies. Self- 
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assembly of SOPV in apolar solutions is initiated by formation of a quadruply 
hydrogen-bonded dimer. The hydrogen-bonded dimer further self-assembles 
into helical stacks via a nucleation—elongation growth mechanism. Right- 
handed P-helices form quickly but are less stable than the left-handed 
M-helices, which form more slowly. 
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Figure 2 | Experiments under kinetic control. Stopped-flow studies on self- 
assembly of SOPV. a, Concentration-dependent kinetics at 293 K. SOPV 
concentrations (black arrow shows increasing concentration) are colour-coded 
as follows: 5 uM (black), 6.1 1M (red), 6.5 UM (light green), 7.8 uM (dark blue), 
8.7 uM (light blue), 9 uM (pink), 10 uM (yellow), 15 uM (dark green) and 

19 uM (orange). b, Concentration-dependent kinetics at 308 K. SOPV 
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concentrations are 10 1M (black), 12.5 UM (red), 15 uM (green), 20 1M (blue) 
and 25 iM (orange). c, Zoom in for initial stages of growth process studied at 
293 K in panel a. d, Time at which 50% of final CD intensity is reached (t — 50) 
versus concentration at 293 K and 308 K. The lines serve to guide the eye. 
e, Zoom in for initial stages of growth process studied at 308 K in panel b. 
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Simulations with this model showed that off-pathway P-SOPV 
aggregates only appear in the initial stages of the self-assembly process 
if sufficient monomers are present and P-SOPV formation is kinetically 
favoured over that of M-SOPV. Specifically, the total monomer con- 
centration (C,o,) has to exceed the critical concentration of P-SOPV 
(Ctot > Ke*'), which depends on the elongation equilibrium constant 
of this species. Moreover, the initial formation of P-SOPV is kinetically 
favoured when the forward rate constant of nucleation and elongation 
is larger for the off-pathway aggregates (a* > a) or if the pre-nucleus 
oligomers of P-SOPV are kinetically more stable (K,* > K,, or a*/ 
b* > a/b) (Supplementary Fig. 9). As a result, off-pathway P-SOPV 
aggregates can only self-assemble at high concentrations and low tem- 
peratures (high K,*), as was observed experimentally. With a* > a, the 
kinetic model successfully described the experimental kinetic data, 
including the initial formation of P-SOPV (Fig. 3b). The calculated 
Gibbs free energy diagram revealed that the P-SOPV nucleus is thermo- 
dynamically less unstable than the M-SOPV nucleus (that is, 
K,,* > K,), whereas M-SOPV is more stable in the elongation phase 
(Fig. 3c). Kinetic studies on protein fibrillation have shown that for the 
initial stages of homogeneously nucleated aggregation involving a 
thermodynamically unstable nucleus, the conversion to the stable 
product is proportional to time squared, ¢” (ref. 9). In our experiments 
with the lowest SOPV concentrations, the changes of the CD signal in 
the lag phases indeed can be described as being proportional to f°, 
demonstrating homogeneous nucleation in the aggregation of SOPV 
(Supplementary Fig. 10). 

To corroborate the proposed involvement of two aggregation 
pathways, we further analysed the concentration dependence of the 
kinetic data and found that the changes in the t— 50 values with 
concentration can only be rationalized by taking off-pathway aggrega- 
tion into account (Fig. 3d, e): the relatively large amount of metastable 
P-type aggregates formed at higher concentrations sequester mono- 
mers and lower their solution concentration, thereby hampering the 
formation of the thermodynamically stable M-type aggregates. Our 
simulations did not show a further decrease of t—50 at higher 


LETTER 


concentrations with reasonable model parameters. Because our 
model only considers monomer addition and dissociation, oligomer 
reactions and fragmentation'‘*—which become more important at 
higher concentrations—may explain this difference between simulated 
and observed behaviour. Simulations exploring the effect of temper- 
ature showed that the maximum in the aggregation rate shifts to higher 
concentrations and the aggregation rate gets faster (that is, t — 50 
values get lower) on increasing temperature, as was observed experi- 
mentally (Supplementary Fig. 11). 

The discovery of these two different assembly pathways, producing 
structurally different aggregates, encouraged us to attempt to direct the 
assembly of SOPV towards metastable products. To force the assembly 
of SOPV into exclusively P-type aggregates instead of the mixture 
obtained upon fast cooling (see above), we used a two-step non- 
covalent synthetic method (Fig. 4a, b) inspired by previous assembly 
control experiments’’. The earlier work involved achiral oligo(p- 
phenylenevinylene) monomers that assemble into chiral helical archi- 
tectures of equal energy, and showed that the addition of chiral 
auxiliaries transforms an initially racemic mixture of these structures 
so that one of the two possible chiral helical architectures dominates”. 
Here, we used a chiral auxiliary to direct the assembly so that, upon 
removal of the auxiliary, we obtain the thermodynamically unstable 
P-type aggregates. 

In the first step, S-chiral dibenzoyl tartaric acid (DTA) was used to 
dictate its chirality to the SOPV monomer via two-fold hydrogen 
bonding, orthogonal to the SOPV dimerization (Supplementary 
Discussion 3). Aggregates having opposite helicity (P-DTA-SOPV) 
compared to equilibrium conditions (M-SOPV) are formed, as 
demonstrated by the opposite CD spectrum (Fig. 4c). In the second 
step, the complete removal of DTA from the SOPV aggregates by 
aqueous extraction at 273K using ethylene diamine resulted in the 
formation of transiently stable P-SOPV aggregates at this temperature 
(Supplementary Fig. 12). The exclusive formation of P-SOPV is 
demonstrated by the absolute ellipticity of P-SOPV, which equals 
the ellipticity of a solution containing exclusively M-SOPV aggregates 
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Figure 3 | Analysis aggregation pathway competition with kinetic model. 
a, Schematic representation of reversible kinetic model for nucleated 
supramolecular polymerization, including a metastable off-pathway and a 
thermodynamically stable on-pathway competing for free monomer. b, Kinetic 
trace of the self-assembly of SOPV and a simulated kinetic trace using the 
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pathway competition model. The inset to b shows a zoom in for the initial stages 
of the growth process. c, Calculated Gibbs free energy diagram for formation of 
P-SOPV and M-SOPV. d, Concentration-dependent simulation of kinetics with 
pathway competition model. e, t — 50 versus SOPV hydrogen-bonded dimer 
concentration obtained from simulated concentration-dependent kinetics. 
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Figure 4 | Controlled formation of exclusively metastable aggregates via a 
two-step non-covalent synthetic methodology. a, Molecular structure of 
S-chiral dibenzoy]l tartaric acid. b, Schematic representation of preparation of 
pure P-SOPV via a two-step non-covalent synthetic methodology. ¢, CD 


under equilibrium conditions. The kinetic lability of P-SOPV is 
demonstrated by annealing at 298K, resulting in a time-dependent 
stereomutation of the CD spectra indicative of a conversion from 
P-SOPV towards M-SOPV aggregates (Fig. 4d). The single isosbestic 
point in these time-dependent CD spectra and their apparent first-order 
kinetics (Supplementary Fig. 13) indeed indicated that two distinct states 
are involved in the conversion process: P-SOPV and M-SOPV. 

Our observations established that the aggregation of SOPV involves 
two competing pathways leading to assemblies with opposite helicity, 
one of which is favoured kinetically and the other thermodynamically. 
We note that kinetically controlled formation of one-dimensional 
supramolecular aggregates has been reported”*”*, but the metastable 
states involved have so far only been probed in terms of their static 
properties (such as morphology or photophysical properties). Our 
kinetic studies, by probing the full aggregation process starting from 
free monomers, reveal that the influence of the metastable state—the 
off-pathway aggregates—on the overall assembly process is mediated 
through the equilibrium with free monomers. This effect is a common 
mechanism for one-dimensional supramolecular systems. We also 
note that almost all organic materials used in functional devices are 
processed starting from solutions of the molecular dissolved compo- 
nents. This means that by influencing the subtleties of the self-assembly 
process generating these materials, through tuning of the on-pathway 
or off-pathway mechanisms, the resulting morphologies could poten- 
tially be controlled to arrive at optimized self-assembled functional 
materials. 


METHODS SUMMARY 


Thermodynamically stable M-SOPV (MCH, 100 1M) is obtained upon slow cool- 
ing from the molecularly dissolved state (60 Kh~') to 273K, and equilibrium 
conditions are verified by time-dependent measurements, showing no changes in 
time. Metastable P-SOPV (MCH, 100 1M) is obtained upon quenching molecularly 
dissolved SOPV to 273 K (ice bath). Stopped-flow experiments are performed by 
mixing a concentrated SOPV solution in chloroform with MCH (1:50 volume ratio, 
Berger ball mixer, mixing time <10ms). The subsequent formation of helical 
aggregates is probed by a CD spectrometer (466 nm) connected in-line with the 
stopped-flow instrument. The absence of (oligomeric) SOPV aggregates at the start 


4 | NATURE | VOL 000 | 00 MONTH 2012 


CD (mdeg) 


400 
Wavelength (nm) 


spectra of M-SOPV, P-DTA-SOPV and P-SOPV. d, CD spectra acquired 
during conversion from metastable P-SOPV towards M-SOPV upon annealing 
at 298 K. 


of the experiment is verified by 'H nuclear magnetic resonance (NMR) studies. The 
nucleated aggregation pathway competition model describes the kinetic data with 
parameters n=n*=5; a=2.9X 10*M's7}; K.= 1.52 X 10°M", K,/ 
K, = 0.0526, K,*/K, = 1.38, K.*/K. = 0.164 and a*/a = 3.79 (Matlab, ode15s sol- 
ver). The value of K, used is independently assessed by melting experiments. 
Alternatively, models using diffusion-controlled kinetics afford unrealistic values 
of K,/K.< 10*. The Gibbs free energy as a function of chain length i is calculated 
using AG? G— >i = —RTn(Ke,q), where R is the gas constant. To obtain P-DTA- 
SOPV, DTA and SOPV are mixed in chloroform (1:1). Subsequently chloroform is 
evaporated; the residual DTA-SOPV complex is dissolved in MCH (100 pM) on 
heating above the critical elongation temperature. Cooling from the molecularly 
dissolved state to 293 K results in P-DTA-SOPV aggregates. On adding an equal 
volume of aqueous ethyl diamine solution (0.15 M) at 273 K, vigorous mixing of the 
water and MCH phase (1 min) and full separation of both phases, the top MCH 
layer containing metastable P-SOPV is isolated. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Materials. The synthesis of SOPV has been described previously”’. All solvents 
were used as received. 

Instrumentation. CD, linear dichroism and corresponding ultraviolet-visual 
spectra were recorded using a Jasco J-815 CD spectrometer. Sensitivity, response 
time and scanning speed were chosen appropriately. The temperature was con- 
trolled using a Jasco Peltier temperature controller with a range of 263-383 K and 
adjustable temperature slope. Stopped-flow studies were performed using a 
Biologic SFM 400 stopped-flow setup with Berger Ball mixer, Biologic TC 100 cuvet 
(optical path length 1 cm), and Biologic MPS-60 controller unit. The stopped-flow 
cuvet was connected in-line with a Jasco J-815 CD spectrometer. To control the 
temperature of the cuvet and syringes, SFM 400 was connected to a Julabo F12 
temperature controller (ethylene glycol bath with thermostat). "H-NMR spectra 
were recorded at 298 K ona Varian Unita Inova (500 MHz), in CDCl; (Cambridge 
Isotope Laboratories). Fluorescence measurements were performed on an 
Edinburgh Instruments FS920 double-monochromator luminescence spectro- 
meter using a Peltier-cooled red-sensitive photomultiplier. Ultraviolet-visual 
spectra acquired on the complexation of o-cyclodextrin with p-nitrophenolate 
were recorded on a Perkin Elmer L40 photospectrometer. 

Experiments under kinetic control. Thermodynamically stable M-SOPV 
(100 1M) in methylcyclohexane (MCH, spectroscopic grade, Sigma-Aldrich) 
was obtained upon slow cooling (60 Kh’ ') from the molecularly dissolved state 
(343 K) to 273 K; equilibrium conditions were verified by time-dependent mea- 
surements, showing no further change in time. Metastable P-SOPV (MCH, 
100 1M) was obtained upon quenching molecularly dissolved SOPV from 343 K 
to 273 K (ice bath). The optical path length for the reported spectra was 1 mm. The 
fraction of P-SOPV and M-SOPV obtained upon quenching SOPV (MCH, 
13 1M) from different temperatures was evaluated from the CD spectra by simu- 
lation of a linear combination of the spectra corresponding to a solution consisting 
of pure P-SOPV (obtained via the two-step non-covalent synthetic methodology) 
anda solution containing M-SOPV exclusively. The fraction of free SOPV monomer 
present before quenching was derived from the CD spectra under equilibrium con- 
ditions assuming ~monomer = 1—Phelical aggregate. The optical path length of the 
reported spectra for these quenching experiments was 1 cm. 

Stopped-flow experiments. To probe the kinetics of the supramolecular 
polymerization of SOPV, SOPV in chloroform (spectroscopic grade, Sigma- 
Aldrich) was mixed with MCH (50:1; 1,508 11:30 pl, injection rate 8-9 mls}, 
mixing time <10ms), after which CD was followed in time (A= 466 nm; 
Ad = 1 nm; At= 1s; standard sensitivity; high tension voltage adjusted to get a 
direct-current voltage around 1 V). The SOPV concentration and the duration of 
the measurement were adjusted appropriately. The absence of (oligomeric) SOPV 
aggregates at the start of the experiment was verified by 'H-NMR studies. Kinetic 
curves were averaged over multiple stopped-flow experiments as follows. At 293 K: 
5 uM, 9; 6.1 uM, 2; 6.5 uM, 6; 7.8 uM, 1; 8.7 uM, 4; 9 WM, 2; 101M, 9; 15 WM, 3; 
19 1M, 6. At 308K: 10 uM, 6; 12.5 1M, 6; 15M, 11; 20§M, 9; 25M, 6. After 
the measurement, the concentration was verified using CD (466nm) and 


ultraviolet—visual (438 nm) spectra. To test the mixing in the stopped-flow setup, 
the complexation of o-cyclodextrin and p-nitrophenolate was measured. Stopped- 
flow experiments were performed upon 100:1 mixing of an o-cyclodextrin 
(Across) and a p-nitrophenolate (Merck) solution. Both solutions were buffered 
atpH = 11 (HPO,” /PO,° buffer). Settings: injection volumes (100:1) 2,011 ul:20 
ul, injection rate 7 mls‘, mixing time <10 ms, 4 = 410 nm, AZ = 1nm,/= 1cm, 
At = 1s, standard sensitivity, high tension voltage = 300 V; data averaged over 5 
injections. 

Aggregation pathway competition kinetics model. The kinetic model describes 
the competing nucleated aggregation pathways of both P-SOPV and M-SOPV via 
a sequence of differential equations. Each differential equation describes monomer 
association and dissociation to the respective aggregate with length i. Assuming 
that for i> N, with N>>n (N= 100), [X;+ 1] = «[X;j], the required number of 
differential equations was reduced to 2(N + 2). The resulting system of differential 
equations is solved using the odel5s solver in Matlab. The aggregation pathway 
competition model describes the kinetic data with parameters n= n* =5; 
a=2.9X 10°M's"'; K, = 1.52 X 10°M|, K,/K. = 0.0526, K,*/K, = 1.38, K.*/ 
K. = 0.164 and a*/a = 3.79. The value of K. used was independently assessed by 
melting experiments. Alternatively, models using diffusion-controlled kinetics 
afforded unrealistic values of K,/K.<10*. The Gibbs free energy diagram was 
calculated via AQ G1): = —RTIn(Kc), with gas constant R, temperature T, 
equilibrium constant K = K,, for i= nand K = K, fori > n and assuming the total 
SOPV hydrogen-bonded dimer concentration Co as the reference state, whereas 
the standard state was defined” at 1 M. The role of temperature on the aggregation 
pathway competition was rationalized by performing kinetic simulations at dif- 
ferent temperatures. The temperature-dependency of the forward rate constant 
was described by the well-known Arrhenius equation: a = Aexp(—E,,/RT), where 
Ais the kinetic prefactor and E,,; the activation energy. The temperature-dependency 
of K, was described using the Van’t Hoff equation: K, = exp(—(AH,° - TAS.°)/RT), 
where AH.” and AS,” represent the standard enthalpy and entropy of elongation, 
respectively. The other rate constants are defined via temperature-independent 
ratios. 

Two-step non-covalent synthetic methodology. To obtain P-DTA-SOPV, the 
DTA (provided by Syncom) and SOPV were mixed in chloroform (1:1). 
Subsequently, chloroform was evaporated by purging with nitrogen gas; the residual 
DTA-SOPV complex (red colour) was dissolved in MCH (100 11M) upon sonication 
and heating until the sample turned green, indicating molecular dissolution. Cooling 
from this molecularly dissolved state to 293 K resulted in P-DTA-SOPV aggregates. 
On adding an equal volume of aqueous ethyl diamine solution (0.15 M) at 273 K, 
vigorous mixing of the water and MCH phase (1 min) and full separation of both 
phases, the top MCH layer containing metastable P-SOPV could be isolated. 
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Multi-isotope imaging mass spectrometry quantifies 
stem cell division and metabolism 


Matthew L. Steinhauser!?, Andrew P. Bailey’, Samuel E. Senyo'’, Christelle Guillermier?*°, Todd S. Perlstein’?, Alex P. Gould’, 


Richard T. Lee'*® & Claude P. Lechene**° 


Mass spectrometry with stable isotope labels has been seminal in 
discovering the dynamic state of living matter’, but is limited to 
bulk tissues or cells. We developed multi-isotope imaging mass 
spectrometry (MIMS) that allowed us to view and measure stable 
isotope incorporation with submicrometre resolution®*. Here we 
apply MIMS to diverse organisms, including Drosophila, mice and 
humans. We test the ‘immortal strand hypothesis’, which predicts 
that during asymmetric stem cell division chromosomes contain- 
ing older template DNA are segregated to the daughter destined to 
remain a stem cell, thus insuring lifetime genetic stability. After 
labelling mice with '°N-thymidine from gestation until post-natal 
week 8, we find no '°N label retention by dividing small intestinal 
crypt cells after a four-week chase. In adult mice administered !°N- 
thymidine pulse-chase, we find that proliferating crypt cells dilute 
the '°N label, consistent with random strand segregation. We 
demonstrate the broad utility of MIMS with proof-of-principle 
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studies of lipid turnover in Drosophila and translation to the 
human haematopoietic system. These studies show that MIMS 
provides high-resolution quantification of stable isotope labels 
that cannot be obtained using other techniques and that is broadly 
applicable to biological and medical research. 

MIMS combines ion microscopy with secondary ion mass spectro- 
metry (SIMS), stable isotope reporters and intensive computation 
(Supplementary Fig. 1). MIMS allows imaging and measuring of stable 
isotope labels in cell domains smaller than one micrometre cubed. We 
tested the potential of MIMS to track DNA labelling quantitatively with 
©N-thymidine in vitro. In proliferating fibroblasts, we detected label 
incorporation within the nucleus by an increase in the °N:'4N ratio 
above natural ratio (Fig. 1a). The labelling pattern resembled chromatin 
with either stable isotope-tagged thymidine or thymidine analogues 
(Fig. 1b). We measured dose-dependent incorporation of '"N-thymidine 
over three orders of magnitude (Fig. 1c and Supplementary Fig. 2). We 
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Figure 1 | MIMS quantification of stable isotope-labelled thymidine 
incorporation by dividing cells. a, Dividing fibroblast labelled with '°N- 
thymidine. The cell surface was sputtered to reach the nuclei. Left, differential 
interference contrast reflection microscopy. Middle, '*N image revealing 
subcellular details including the nucleus (white arrows) with nucleoli. Right, 
hue saturation intensity (HSI) image mapping the '°N:"4N ratio. The rainbow 
scale ranges from blue, set to natural ratio (0.37%, expressed as 0% above 
natural ratio), to red, where the ratio is several fold above natural ratio (700%, 
eight times the natural ratio). °N labelling is concentrated in the nucleus. Scale 
bars, 10 jum. b, Fibroblast nuclei after serial or parallel DNA labelling with ISN- 
thymidine, C-thymidine, BrdU (*'Br) or iododeoxyuridine (?71). Parallel 
labelling (top row): colocalization of label confirmed by the merged image (far 


right). Sequential labelling (bottom row): non-superimposable nuclear 
labelling. Scale bars, 5 jum. ¢, Concentration-dependent nuclear N-labelling in 
'SN-thymidine treated fibroblasts. Pixel-by-pixel quantification of !°N- 
incorporation. Data are derived from the mean of intranuclear pixels. 
Sigmoidal dose response curve: R* = 0.99. d, Nuclei from '°N-thymidine- 
labelled human foreskin fibroblasts after 24-h chase (cycle approximately 18 h). 
Scale bars, 5 um. e, One group labelled similarly to control cells (undivided), the 
other with labelling that was approximately half that of control (divided). 

f, '°N:'“N HSI image of the small intestine. '"N-thymidine labelling (one week) 
of nuclei extends from the crypts to the tips of the villi. Mosaic: 8 tiles, 80 pm 
each. Scale bar, 30 1m. g, Sigmoidal dose response curve: 'SN-thymidine 
labelling after single subcutaneous injection (R* = 0.99). 
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also tracked fibroblast division after a 24-h label-free chase (Fig. 1d, e 
and Supplementary Fig. 3). Cells segregated into two populations, one 
indistinguishable from control cells suggesting no division, the other 
with halving of label, consistent with one division during the chase. 

We found similar results by tracking cell division in vivo in the small 
intestine (Fig. 1f, g and Supplementary Figs 4-6). We measured dose- 
dependent '°N-thymidine incorporation within nuclei of actively 
dividing crypt cells (Fig. 1g and Supplementary Fig. 4), down to a dose 
of 0.1 ug per g (Supplementary Fig. 2). The cytoplasm was slightly 
above natural ratio, probably due to low-level soluble '"N-thymidine 
or mitochondrial incorporation (Supplementary Fig. 2). We measured 
halving of label with each division during label-free chase (Supplemen- 
tary Fig. 6). 

We then tested the ‘immortal strand hypothesis’, a concept that 
emerged from autoradiographic studies’ and that predicted long-term 
label retaining cells in the small intestinal crypt*’. It proposes that 
asymmetrically dividing stem cells also asymmetrically segregate 
DNA, such that older template strands are retained by daughter cells 
that will remain stem cells and newer strands are passed to daughters 
committed to differentiation (Supplementary Fig. 7)*°. Modern studies 
continue to argue both for*” or against’*-’* the hypothesis, leading to 
the suggestion that definitive resolution of the debate will require a new 
experimental approach”. 

Although prior evidence suggests a concentration of label-retaining 
cells in the +4 anatomic position”*, we searched for DNA label reten- 
tion irrespective of anatomic position or molecular identity. We 
labelled mice with °N-thymidine for the first 8 weeks of life when 
intestinal stem cells are proposed to form’. After a 4-week chase, mice 
received bromodeoxyuridine (BrdU) for 24h before euthanasia to 
identify proliferating cells (Fig. 2a and Supplementary Fig. 8, Experi- 
ment 1), specifically crypt base columnar (CBC) cells and transit amp- 
lifying cells (Supplementary Fig. 9), which cycle at a rate of one and two 
times per 24h, respectively'*® (Supplementary Fig. 10). All crypt cell 
nuclei were highly labelled upon completion of '"N-thymidine; after a 
4-week chase, however, we found no label retention by non-Paneth 
crypt cells (Fig. 2b-f; n = 3 mice, 136 crypts analysed). *°N-labelling in 
N*/BrdU- Paneth and mesenchymal cells was equivalent to that 
measured at pulse completion (Fig. 2b, c) indicating quiescence during 
the chase (values above '°N:'4N natural ratio: Paneth pulse = 107.8 + 
5.0% s.e.m. m= 51 versus Paneth pulse-chase = 96.3 + 2.8% s.e.m. 
n= 218; mesenchymal pulse = 92.0+5.0% s.em. n= 89 versus 
mesenchymal pulse-chase = 90.5 + 2.2% s.e.m. n = 543). The number 
of randomly selected crypt sections was sufficient to detect a frequency 
as low as one label-retaining stem cell per crypt irrespective of ana- 
tomic location within the crypt. Because each anatomic level contains 
approximately 16 circumferentially arrayed cells*, a two-dimensional 
analysis captures approximately one eighth of the cells at each ana- 
tomic position (one on each side of the crypt; Supplementary Fig. 9a). 
Therefore, assuming only one label-retaining stem cell per crypt we 
should have found 17 label-retaining cells in the 136 sampled crypts 
(one eighth of 136) but we found 0 (binomial test P< 0.0001). The 
significance of this result held after lowering the expected frequency of 
label-retaining cells by 25% to account for the development of new 
crypts, a process thought to continue into adulthood”. In three addi- 
tional experiments, using shorter labelling periods and including in 
utero development, we also found no label-retaining cells in the crypt 
other than Paneth cells (Supplementary Fig. 8, Experiments 2-4). 

To address the possibility that long-term thymidine exposure or 
frequent high-dose injections introduced pitfalls, we limited labelling 
to the previously reported peak time period of label-retaining stem cell 
formation® and reduced the dose by 50-fold compared to prior experi- 
ments (Supplementary Fig. 8, Experiment 5). Approximately six label- 
retaining cells per intestinal circumference were observed previously® 
with a similar protocol; with about 90 crypts per circumference, this 
translates to 1 label-retaining cell per 15 sectioned crypts. We analysed 
330 crypts and observed 19 '"N*/BrdU~ Paneth cells (one per 17 crypt 
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Figure 2 | No label-retaining stem cells in the small intestinal crypt. a, °N- 
thymidine administered from post-natal day 4 to week 8. After 4-week chase, 
BrdU was administered (500 jig intraperitoneally every 6h) for 24h before 
euthanasia (see Supplementary Fig. 8). b, ‘“N: crypt structure and intense signal 
in intracytoplasmic Paneth granules at the crypt base. *'P: intense intranuclear 
signal. **S: intense signal within cytoplasmic Paneth cell granules. *'Br: direct 
measure of BrdU incorporation. '°N:''N: HSI image of '"N-thymidine labelling 
within °N*/BrdU~ Paneth cells (large arrow) and mesenchymal cells (small 
arrows). No other cells reveal °N retention. Large arrow head: recently formed 
(BrdU*/'°N7 ) Paneth cell. Small hatched arrow (middle left side of the crypt): 
unlabelled Paneth cell, (""N~/BrdU_ ). Scale bar, 15 pm. c, Continued analysis 
of the same crypt after narrowing the acquisition field. High '°N signal in a 
BrdU Paneth cell (large arrow) and mesenchymal cells closely associated with 
the crypt (small arrows). BrdU* CBC (hatched arrow) and Paneth cell (arrow 
head) are '°N_. Scale bar, 5 um. d, SN :4N HSI image of small intestinal crypt 
at the end of '°N-thymidine pulse. All nuclei are labelled. Nuclei with lesser 
degrees of labelling likely represent cells born during a period of lower 
circulating '"N-thymidine as expected given the different labelling protocols 
(Supplementary Fig. 8). Scale bar, 20 jum. e, Unlabelled mouse image. The entire 
crypt contains the natural abundance of SN. Scale bar = 10 jum. f, Mean % 
1SN* cells at the completion of pulse and after pulse-chase ( + standard error of 
CBC, transit amplifying (TA), Paneth and mesenchymal (Mes) cells (n = 3 
mice per group). 


sections), but no non-Paneth label-retaining cells (expected, 21; 
observed, 0; binomial test P<0.0001) or doubly labelled (°N*/ 
BrdU") cells irrespective of cellular identity. Thus, after labelling dur- 
ing all potential periods of stem cell formation, spanning in utero 
through post-natal development, we found no label-retaining stem 
cells in the small intestinal crypt other than Paneth cells, which are 
largely quiescent. We conclude therefore that the phenomenon of 
label-retaining cells as described by others was most likely due either 
to reagent toxicity or to insufficient resolution with the consequent 
misidentification of labelled post-mitotic Paneth cell nuclei as label 
retaining +4 stem cells. MIMS minimizes these potential artefacts due 
to its ability to detect non-radioactive isotopes and the high imaging 
resolution both laterally and vertically. Although our data cannot 
directly exclude the migration of label-retaining stem cells into the 
crypt under non-homeostatic conditions such as radiation injury", 
prior examinations of crypt clones argue against contributions from 
either a resting stem cell population or from stem cells originating 
outside the crypt”. 

We also studied label release in adult mice after stem cell formation. 
These experiments are particularly relevant given the recent suggestion 
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that crypt stem cells exhibit monoclonal drift over 8 months*’”, a 


model that is incompatible with lifetime survival of label-retaining 
stem cells but that does not exclude the possibility of biased template 
strand segregation. After administering '"N-thymidine for 2 weeks to 
label all proliferating crypt cells, '*"N-thymidine was stopped and BrdU 
was administered for 24h to label cells dividing during the '"N- 
thymidine-free chase (Fig. 3a). A consequence of the immortal strand 
hypothesis would be that a stem cell would produce a daughter in 
which '°N label was completely lost with the first division during chase 
(Supplementary Fig. 7). We observed no such BrdU" /'°N” crypt cells, 
even after extending the 'N-thymidine-free chase to 48 h (Fig. 3b and 
Supplementary Fig. 11). Proliferating (BrdU*) crypt cells (n = 3mice; 
563 cells analysed) diluted 'N-thymidine label (mean 71% above 
natural ratio) compared to undivided (BrdU ) cells (129% above natural 
ratio) or cells analysed at the end of '"N-thymidine pulse (128% above 
natural ratio), as expected of cells undergoing random chromosomal 
segregation (Fig. 3c). We also analysed crypt cell nuclei in late mitosis, 
finding '°N label and BrdU in both sets of segregating chromosomes 
(Fig. 3d and Supplementary Fig. 12, n = 232). Together, these data result 
from an analysis of 625 crypts, in which we did not find a single BrdU* 
nucleus or chromosomal complement that was not '°N-labelled, indi- 
cating that dividing cells in the crypt uniformly dilute labelled DNA ina 
pattern consistent with random DNA strand segregation. 

The broad applicability of MIMS is demonstrated by proof-of- 
principle studies in Drosophila and humans. Drosophila provides a 
model system in which to study the genetics of lipid metabolism, in vivo. 
Due to its small size, however, it is difficult to obtain sufficient material 
from specific tissue/cell types for measurement with conventional mass 
spectrometry. A previous study showed numerous lipid droplets in the 
enterocytes of a segment of the Drosophila larval intestine, the anterior 
midgut”. We therefore used MIMS to measure the incorporation and 
turnover of diet-derived '*C-palmitate within these cells. After larval 
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18N:14N (% above natural ratio) 


——< 
BrdU- BrdU+ 


Pulse Chase 


Figure 3 | Label-dilution in adult mice indicates random segregation of 
DNA strands. a, ‘SN-thymidine administered for 2 weeks to adult mice 
(osmotic mini-pump, 20 [1g h_}), then BrdU for 24h before euthanasia (See 
Supplementary Fig. 8). b, Dividing cells (BrdU*) dilute °N-thymidine label 
(small arrows) relative to undivided cells (BrdU ) (large arrows). Note two 
CBC cells with elongated nuclei at the crypt base. Scale bar, 10 jum. ¢, Divided 
crypt cells (BrdU), residing in CBC or +4-10 positions, demonstrated PN- 
dilution consistent with one or two rounds of division during the chase (n = 3 
mice per group). d, Mitotic crypt cell. Segregating chromosomes are visible in 
“4N and *'P images. '°N-label and BrdU were measured in both segregating 
chromosomal complements consistent with symmetric chromosomal 
segregation. Scale bar, 2 jim. 
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exposure to dietary '*C-palmitate, label is incorporated into enterocyte 
lipid droplets with a subcellular pattern similar to that seen with a 
neutral lipid dye (compare Fig. 4a with supplementary Fig. 13). 
Dietary '*C-palmitate pulses of varying periods indicate that the rate 
of °C label incorporation into lipid droplets is greater in anterior 
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Figure 4 | Extending MIMS to Drosophila and human metabolism. 

a, Drosophila anterior midgut enterocytes (gut, left panels) accumulate more 
‘SC from 24-h exposure to dietary '*C-palmitate than adipocytes (fat body cells 
(fb), right panels). Lipid droplets (arrows) have low nitrogen content and so 
appear dark in '‘N images (top panels). The '°C:'C HSI images reveal °C 
incorporation into lipid droplets. Scale bars, 10 1m. b, Time course of °C 
accumulation in Drosophila lipid droplets of anterior midgut enterocytes (Gut) 
and adipocytes (Fat body) over 24h of larval exposure to dietary '*C-palmitate. 
Initial rates of '°C-incorporation in the lipid droplets were 9.7% h_* (Gut) and 
1.16% h_' (Fat body) with the first time point after pulse taken at 1h. c, Rapid 
lipid turnover in Drosophila anterior midgut enterocytes (gut) after larval 
dietary '*C-palmitate pulse-chase. Upper panels show 'N images and lower 
panels show corresponding '*C:'*C HSI images, indicating strong '°C 
incorporation at the end of the pulse (bottom left) and rapid elimination during 
a 4-h chase (bottom right). Scale bar = 5 pum. d, 13C tracer dilution in the lipid 
droplets of anterior midgut enterocytes during a label-free chase fits an 
exponential decay curve (R? = 0.90, half-life = 9.1 h). e, °N-thymidine 
administered for 48 h: no labelled white blood cell were found. After a 4-week 
chase we found few labelled lymphocytes. Top: unlabelled polymorphonuclear 
leukocyte (PMN) with multi-lobed nucleus. Bottom: '°N-labelled lymphocyte 
after chase. The round nucleus seen in the *'P image occupies the majority of 
the cell, as is typical for lymphocytes. Scale bars, 3 um. 
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midgut enterocytes than in the adipocytes of the fat body, the major 
lipid storage depot in Drosophila (Fig. 4a, b). Pulse-chase experiments 
with dietary '*C-palmitate also demonstrate a reproducible exponen- 
tial decay (R* = 0.9) of the °C signal within enterocyte lipid droplets 
and provide an estimate for the half-life of approximately 9 h (Fig. 4c, 
d). These results demonstrate the utility of MIMS for precise measure- 
ment of in vivo lipid turnover at high resolution within individual 
Drosophila lipid droplets. 

In the first MIMS human experiment, we administered Sy). 
thymidine to a healthy volunteer by intravenous infusion for 48h. 
We performed MIMS analysis on peripheral white blood cell smears. 
We found no labelled white blood cells at the end of the infusion (3,600 
analysed). After a 4-week chase we found a few labelled white blood 
cells (Fig. 4c, 4 of approximately 2,000 cells analysed) consistent with a 
lag time before release from the bone marrow (chi-squared, P < 0.01). 
Radiolabelled thymidine has been administered to humans in numerous 
studies previously”, but these studies were conducted primarily on 
cancer patients. There is no known risk to using stable isotopes”. 
Thus, this experiment now opens the door to studies of metabolism 
and cell tracking in humans. 

In conclusion, we provide direct evidence under physiological con- 
ditions against non-random template strand segregation in the small 
intestine. Our approach encompasses all potential stem cell populations 
in the crypt, those identifiable by known markers**”* and putative ones 
not identified by a specific marker. We have demonstrated the power of 
MIMS to image and quantify cell metabolism in three different con- 
texts: mammalian intestinal cell division, Drosophila lipid metabolism 
and human lymphopoiesis. We anticipate that the measuring and 
imaging power of MIMS will make it a major tool to study metabolism 
and cell fate in animals and humans. 


METHODS SUMMARY 


Data were acquired with the prototype (four detectors) and a second-generation 
instrument (NanoSIMS 50L, Cameca: seven detectors). Quantitative mass images 
were analysed with Open MIMS software, a plug-in to Image] (http://www.nrims. 
harvard.edu/software.php). Stable isotopes were obtained from Cambridge 
Isotopes (°N-thymidine, 3C-thymidine) and Sigma-Aldrich (?C-palmitate). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Multi-isotope imaging mass spectrometry (MIMS). MIMS is based on secondary 
ion mass spectrometry (SIMS). An ionic beam sputters the surface atomic layer ofa 
sample, resulting in the ionization ofa small atomic fraction. Ina SIMS instrument, 
secondary ions are separated by mass, and then used to derive a quantitative atomic 
mass image of the surface of the analysed sample. The instrument can simulta- 
neously measure data from multiple isotopes from the same region. Data included 
in this study are derived from both the NanoSims prototype (ONERA, Université 
Paris Sud/Cameca) and a second-generation instrument (NanoSims 50L, 
Cameca). The second-generation instrument is fully automated and has the capa- 
city to quantify seven masses simultaneously. 

Quantitative image data extraction. For each pixel, the number of counts of a 
given secondary ion is stored in a computer file. The data are reconstructed into a 
greyscale image in which the pixel intensity is derived from the total number of 
counts of a given secondary ion within the area representing a given pixel. The 
lateral resolution is dependent on factors, including the beam size and the number 
of pixels per image acquisition area. A series of images based on the analysis of 
successive atomic layers can be summed. The incorporation of halogenated 
nucleotide analogues (BrdU, iododeoxyuridine) is assessed by directly quantifying 
the ionized halogen atoms (°'Br, 1277), which are found at extremely low back- 
ground concentrations in most tissues. Halogen enrichment within a given region 
of interest is expressed as the percent enrichment over natural background. 

Hue saturation intensity ratio images. The localization of tracers enriched with 
rare stable isotopes (for example, '"N-thymidine) is done by deriving quantitative 
ratio images. The ratio of a rare stable isotope and the common stable isotope is 
quantified on a pixel by pixel basis. Incorporation of a stable isotope within a 
region of interest is determined when the ratio of the rare stable isotope to the 
common stable isotope exceeds the natural ratio. Because quantitative images 
acquired with MIMS generate far more data than can be represented by a grey- 
scale, we previously developed a method of visually representing ratio data based 
on a hue saturation intensity (HSI) transformation. With an HSI image, the scale 
can be set such that the lower bounds of the scale (blue colours) are assigned to 
pixels in which the corresponding rare-to-common stable isotope ratio is equi- 
valent to the natural ratio. An important advantage of an HSI image is that the 
scale can be extended or compressed to provide an optimal colour representation 
of regions of differential stable isotope enrichment. However, any image manip- 
ulations, such as those resulting from scale changes, do not modify the quantitative 
data underlying the colour representation. 

Data processing and export. MIMS images are viewed and processed using Open 
MIMS software, a custom-built plugin to the ImageJ software (http://www.nrims. 
harvard.edu/software.php). In the experiments contained herein, nuclei were 
manually identified as regions of interest within the crypts of the small intestine 
using a combination of '*N and *’P mass images. Cellular identity was assigned by 
an observer unaware of the labelling status of the cells. Paneth cells were identified 
on the basis of their characteristic cytoplasmic granules seen in '“N and *’S images, 
large round nuclei seen in '*N and *'P images, and their anatomic location at or 
near the base of the crypt (Supplementary Fig. 9). *N-labelled Paneth cells (Fig. 2 
and Supplementary Fig. 8 Experiment 1) had a number of granules with a median 
of 8 (interquartile range = 5-11, n = 218). The granules were not separated from 
the nucleus by a plasma membrane, which would have been detected by MIMS. 
CBC cells were identified on the basis of their location between Paneth cells at the 
base of the crypt and their narrow, elongated nuclei. The +4 position was assigned 
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to the first non-Paneth cell superior to the Paneth zone at the base of the crypt. 
Cells in the +4 position and above (transit amplifying cells) typically had elliptical 
nuclei. 

*°N-labelling is expressed as the '°N:'4N ratio (percentage above natural ratio) 
obtained using the following equation: 


(5N :!4N ratio) — (natural ratio) 


natural ratio 


N :14N(% above natural ratio) x 100 


Mosaic images. The maximum field size analysed in this study was 80 um X 80 
uum. The presentation of images from larger tissue swaths is made possible by the 
construction of mosaic images in which a series of adjacent fields are analysed. 
After MIMS acquisition, the full series of tiles is assembled to form a larger mosaic 
image. 

Statistics. Data were analysed using JMP 8.0.1 and Prism 3.0 (GraphPad). Two- 
way t-tests were performed for comparisons of two experimental groups. For 
analyses of multiple experimental groups, one-way ANOVA and Bonferroni 
correction were used. Non-parametric data were analysed with a Kruskal- 
Wallis test and Dunn’s multiple comparisons test. Binomial testing was used to 
compare observed versus predicted event frequencies. 

Reagents. '°N-thymidine and '*C-thymidine were obtained from Cambridge 
Isotopes. '°N-thymidine, used for human administration, was subjected to addi- 
tional endotoxin and microbial limits testing before packaging. '*C-palmitate was 
obtained from Sigma-Aldrich. Bromodeoxyuridine (BrdU) and iododeoxyuridine 
(IdU) were obtained from Sigma-Aldrich. 

Cell culture. Human foreskin fibroblasts were grown in DMEM/Ham’s F12 sup- 
plemented with 10% FBS, 1% penicillin/streptomycin/L-glutamine. For in vitro 
labelling experiments, fibroblasts were grown on silicon chips. Prior to MIMS 
analysis, cells were fixed and spun dry (Headway Research Spinner, PWM32) at 
3,500 r.p.m. for 2 min. 

Mice. C57B6 mice were used in accordance with the Guide for the Use and Care of 
Laboratory Animals and approved by the Harvard Medical School Standing 
Committee on Animals. Osmotic minipumps (Alzet) were implanted subcuta- 
neously. Jejunal segments were fixed (4% paraformaldehyde) and embedded (LR 
white). Sections (0.5 um) were mounted on silicon chips. 

Drosophila. Drosophila larvae were fed a standard yeast/cornmeal diet** supple- 
mented with 1% w/v '%C-labelled palmitic acid-1 (Sigma-Aldrich). For chase 
experiments, larvae were collected at 72h of development, following a label-free 
chase period of 0-24h, using standard food supplemented with 1% unlabelled 
palmitic acid (Sigma-Aldrich). Following dissection, anterior midguts or fat bodies 
were fixed in 2% formaldehyde, post-fixed in 2% glutaraldehyde then 2% osmium 
tetroxide, embedded in Epon resin, sectioned (thickness = 2 um), and mounted 
on silicon chips. 

Human protocol. The Partners Institutional Review Board approved the 
protocol. After obtaining informed consent, the subject was admitted to the 
Brigham and Women’s Hospital clinical research unit. Prior to administration, 
N-thymidine was resuspended in 500ml 0.9% NaCl and passed through a 
0.2 um filter. The subject received '°N-thymidine intravenously (30 mg bolus over 
5 min, then 15mg h’! for 48 h) without adverse events. Peripheral blood was 
collected in an EDTA-containing tube at the conclusion of the infusion and after 
a 4-week chase. The blood sample was centrifuged (300g, 5 min), the buffy coat 
collected, and red blood cells lysed (ACK buffer, Invitrogen). The white blood cell 
suspension was smeared onto silicon chips and fixed with 4% paraformaldehyde. 
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Most patients with acute myeloid leukaemia (AML) die from pro- 
gressive disease after relapse, which is associated with clonal evolu- 
tion at the cytogenetic level'’. To determine the mutational 
spectrum associated with relapse, we sequenced the primary tumour 
and relapse genomes from eight AML patients, and validated 
hundreds of somatic mutations using deep sequencing; this allowed 
us to define clonality and clonal evolution patterns precisely at 
relapse. In addition to discovering novel, recurrently mutated genes 
(for example, WAC, SMC3, DIS3, DDX41 and DAXX) in AML, we 
also found two major clonal evolution patterns during AML relapse: 
(1) the founding clone in the primary tumour gained mutations and 
evolved into the relapse clone, or (2) a subclone of the founding 
clone survived initial therapy, gained additional mutations and 
expanded at relapse. In all cases, chemotherapy failed to eradicate 
the founding clone. The comparison of relapse-specific versus 
primary tumour mutations in all eight cases revealed an increase 
in transversions, probably due to DNA damage caused by cytotoxic 
chemotherapy. These data demonstrate that AML relapse is asso- 
ciated with the addition of new mutations and clonal evolution, 
which is shaped, in part, by the chemotherapy that the patients 
receive to establish and maintain remissions. 

To investigate the genetic changes associated with AML relapse, and 
to determine whether clonal evolution contributes to relapse, we per- 
formed whole-genome sequencing of primary tumour-relapse pairs 
and matched skin samples from eight patients, including unique patient 
identifier (UPN) 933124, whose primary tumour mutations were previ- 
ously reported’. Informed consent explicit for whole-genome sequen- 
cing was obtained for all patients on a protocol approved by the 
Washington University Medical School Institutional Review Board. 
We obtained >25X haploid coverage and >97% diploid coverage for 
each sample (Supplementary Table 1 and Supplementary Information). 
These patients were from five different French-American-British 
haematological subtypes, with elapsed times of 235-961 days between 
initial diagnosis and relapse (Supplementary Table 2a, b). 

Candidate somatic events in the primary tumour and relapse genomes 
were identified** and selected for hybridization capture-based validation 
using methods described in Supplementary Information. Deep sequen- 
cing of the captured target DNAs from skin (the matched normal tissue), 
primary tumour and relapse tumour specimens® (Supplementary 
Table 3) yielded a median of 590-fold coverage per site. The average 
number of mutations and structural variants was 539 (range 118- 
1,292) per case (Fig. 1a). 

The general approach for relapse analysis is exemplified by the first 
sequenced case (UPN 933124). A total of 413 somatic events from tiers 


1 to 3 were validated (see ref. 7 for tier designations; Supplementary 
Fig. la and Supplementary Tables 4a and 5). Of these, 78 mutations 
were relapse-specific (63 point mutations, 1 dinucleotide mutation, 13 
indels and 1 translocation; relapse-specific criteria described in Sup- 
plementary Information and shown in Supplementary Fig. 1b), 5 point 
mutations were primary-tumour-specific, and 330 (317 point mutations 
and 13 indels) were shared between the primary tumour and relapse 
samples (Fig. 1a, b and Supplementary Fig. 2). The skin sample was 
contaminated with leukaemic cells for this case (peripheral white blood 
cell count was 105,000 cells mm? when the skin sample was banked), 
with an estimated tumour content in the skin sample of 29% 
(Supplementary Information). In addition to the ten somatic non- 
synonymous mutations originally reported for the primary tumour 
sample’, we identified one deletion that was not detected in the original 
analysis (DNMT3A L723fs (ref. 8)) and three mis-sense mutations previ- 
ously misclassified as germline events (SMC3 G662C, PDXDC1 E421K 
and TTN E14263K) (Fig. 1b, Table 1 and Supplementary Table 4b). 

A total of 169 tier 1 coding mutations (approximately 21 per case) 
were identified in the eight patients (Table 1 and Supplementary 
Tables 4b and 6), of which 19 were relapse-specific. In addition to 
mutations in known AML genes such as DNMT3A (ref. 8), FLT3 
(ref. 9), NPM1 (ref. 10), IDHI1 (ref. 7), IDH2 (ref. 11), WT (ref. 12), 
RUNX1 (refs 13, 14), PTPRT (ref. 3), PHF6 (ref. 15) and ETV6 (ref. 16) 
in these eight patients, we also discovered novel, recurring mutations 
in WAC, SMC3, DIS3, DDX41 and DAXX using 200 AML cases whose 
exomes were sequenced as part of the Cancer Genome Atlas AML 
project (Table 1, Supplementary Table 4b and Supplementary Fig. 3; 
T.J.L., R-K.W. and The Cancer Genome Atlas working group on AML, 
unpublished data). Details regarding the novel, recurrently mutated 
genes are provided in Table 1, Supplementary Tables 4b and 7 and 
Supplementary Figs 3 and 4. Structural and functional analyses of 
structural variants are presented in the Supplementary Information 
(Supplementary Figs 5-10 and Supplementary Tables 2, 8 and 9). 

The generation of high-depth sequencing data allowed us to quantify 
accurately mutant allele frequencies in all cases, permitting estimation 
of the size of tumour clonal populations in each AML sample. On the 
basis of mutation clustering results, we inferred the identity of four 
clones having distinct sets of mutations (clusters) in the primary 
tumour of AML1/UPN 933124 (Supplementary Information). The 
median mutant allele frequencies in the primary tumour for clusters 
1 to 4 were 46.86%, 24.89%, 16.00% and 2.39%, respectively (Fig. 1b and 
Supplementary Table 5c). Clone 1 is the ‘founding’ clone (that is, the 
other subclones are derived from it), containing the cluster 1 mutations; 
assuming that nearly all of these mutations are heterozygous, they must 
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Figure 1 | Somatic mutations quantified by deep sequencing of capture 
validation targets in eight acute myeloid leukaemia primary tumour and 
relapse pairs. a, Summary of tier 1-3 mutations detected in eight cases (not 
including translocations). All mutations shown were validated using capture 
followed by deep sequencing. Shared mutations are in grey, primary tumour- 
specific mutations in blue and relapse-specific mutations in red. The total 
number of tier 1-3 mutations for each case is shown above the light-grey 
rectangle. b, Mutant allele frequency distribution of validated mutations from 
tier 1-3 in the primary tumour and relapse of case UPN 933124 (left). Mutant 
allele frequencies for five primary-tumour-specific mutations were obtained 
from a 454 deep read-count experiment. Four mutation clusters were identified 
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in the primary tumour, and one was found at relapse. Five low-level mutations 
in both the primary tumour and relapse (including four residing in known copy 
number variable regions) were excluded from the clustering analysis. Non- 
synonymous mutations from genes that are recurrently mutated in AML are 
shown. The change of mutant allele frequencies for mutations from the five 
clusters is shown (right) between the primary tumour and relapse. The orange 
and red lines are superimposed. c, The mutation clusters detected in the 
primary tumour and relapse samples from seven additional AML patients. The 
relationship between clusters in the primary tumour and relapse samples are 
indicated by lines linking them. 


Table 1 | Coding mutations identified in eight primary tumour-relapse pairs 


UPN Total tier 1 mutations Recurrently mutated genes Relapse-specific non-synonymous 
(primary/relapse) in primary tumour somatic mutations 

452198 9/9 DNMT3A, NPM1, FLT3, IDH1 None 
573988 6/8 NPM1, IDH2 STOX2 
804168 22/26 FLT3, WT1, PHF6, FAMS5C, TTC39A SLC25A12, RIPK4, ABCD2 
933124 14/17 DNMT3A, NPM1, FLT3, TTN, SMC3, PTPRT ETV6*, MYO18B*, WAC*+, STK4 
400220 12/13 FLT3, RUNX1, WT1, PLEKHH1 None 
426980 32/35 IDH2, MYO1F, DDX4 GBP4, DCLK1, IDH2*, DCLK1*, ZNF260 
758168 15/19 DNAHSY, DIS3, CNTN5, PML-RARAt ENSGO0000180144, DAGLA* 
869586 51/50 RUNX1, WT1, TTN, PHF6, NF1, SUZ12, NCOA7, EED, DAXX, ACSS3, WAC, NUMA1 None 


Tier 1 mutation counts exclude RNA genes. 
* Recurrent mutations occurring in relapse sample. 
+ Translocations were not included in tier 1 mutation counts. 
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be present in virtually all the tumour cells at presentation and at relapse, 
as the variant frequency of these mutations is ~40-50%. Clone 2 (with 
cluster 2 mutations) and clone 3 (with cluster 3 mutations) must be 
derived from clone 1, because virtually all the cells in the sample contain 
the cluster 1 mutations (Fig. 2a). Itis likely that a single cell from clone 3 
gained a set of mutations (cluster 4) to form clone 4: these survived 
chemotherapy and evolved to become the dominant clone at relapse. 
We do not know whether any of the cluster 4 mutations conferred 
chemotherapy resistance; although none had translational consequences, 
we cannot rule out a relevant regulatory mutation in this cluster. 
Assuming that all the mutations detected are heterozygous in the 
primary tumour sample (with a malignant cellular content at 93.72% 
for the primary bone marrow sample, see Supplementary Informa- 
tion), we were able to calculate the fraction of total malignant cells in 
each clone. Clone 1 is the founding clone; 12.74% of the tumour cells 
contain only this set of mutations. Clones 2, 3 and 4 evolved from clone 
1. The additional mutations in clones 2 and 3 may have provided a 
growth or survival advantage, as 53.12% and 29.04% of the tumour 
cells belonged to these clones, respectively. Only 5.10% of the tumour 
cells were from clone 4, indicating that it may have arisen last (Fig. 2a). 
However, the relapse clone evolved from clone 4. A single clone 


a Clonal fractions at initial diagnosis 


«HSCs 
P * DNMTSA, NPM1, FLT3, PTPRT, SMC3 

om 

AML’ 
Cell type: Mutations: 
®@ Normal ea o Founding (cluster 1) , 
@ Primary specific (cluster 2) @ Relapse enriched (cluster 4) 

b 


Model 1 (UPNs 400220, 573988, 804168) 


Chemotherapy 


Model 2 (UPNs 426980, 452198, 758168, 869586, 933124) 


Be 
»eed" 


Chemotherapy 


Figure 2 | Graphical representation of clonal evolution from the primary 
tumour to relapse in UPN 933124, and patterns of tumour evolution 
observed in eight primary tumour and relapse pairs. a, The founding clone in 
the primary tumour in UPN 933124 contained somatic mutations in 
DNMT3<A, NPM1, PTPRT, SMC3 and FLT3 that are all recurrent in AML and 
probably relevant for pathogenesis; one subclone within the founding clone 
evolved to become the dominant clone at relapse by acquiring additional 
mutations, including recurrent mutations in ETV6 and MYO18B, and a 
WNK1-WAC fusion gene. HSC, haematopoietic stem cell. b, Examples of the 
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containing all of the cluster 5 mutations was detected in the relapse 
sample; clone 5 evolved from clone 4, but gained 78 new somatic 
alterations after sampling at day 170. As all mutations in clone 5 appear 
to be present in all relapse tumour cells, we suspect that one or more of 
the mutations in this clone provided a strong selective advantage that 
contributed to relapse. The ETV6 mutation, the MYO18B mutation, 
and/or the WNK1-WAC fusion are the most likely candidates, as 
ETV6, MYO18B and WAC are recurrently mutated in AML. 

We evaluated the mutation clusters in the seven additional primary 
tumour-relapse pairs by assessing peaks of allele frequency using 
kernel density estimation (Supplementary Fig. 11 and Supplemen- 
tary Information). We thus inferred the numbers and malignant frac- 
tions of clones in each primary tumour and relapse sample. Similar to 
UPN 933124, multiple mutation clusters (2-4) were present in each of 
the primary tumours from four patients (UPN 869586, UPN 426980, 
UPN 452198 and UPN 758168). However, only one major cluster was 
detected in each of the primary tumours from the three other patients 
(UPN 804168, UPN 573988 and UPN 400220) (Fig. 1c and Sup- 
plementary Table 10). Importantly, all eight patients gained relapse- 
specific mutations, although the number of clusters in the relapse 
samples varied (Fig. 1). 


Day 170 First relapse 


» ETV6, WNK1-WAC, 


MyYO18B 


Chemotherapy 


® Pathogenic mutations 


AML40/UPN804168 


Relapse variant frequency (%) 


020° 40° 60-80 100 
Tumour variant frequency (%) 


Relapse variant frequency (%) 


oO al 40 80 100 
Tumour variant frequency (%) 


two major patterns of tumour evolution in AML. Model 1 shows the dominant 
clone in the primary tumour evolving into the relapse clone by gaining relapse- 
specific mutations; this pattern was identified in three primary tumour and 
relapse pairs (UPN 804168, UPN 573988 and UPN 400220). Model 2 shows a 
minor clone carrying the vast majority of the primary tumour mutations 
survived and expanded at relapse. This pattern was observed in five primary 
tumour and relapse pairs (UPN 933124, UPN 452198, UPN 758168, UPN 
426980 and UPN 869586). 
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Two major patterns of clonal evolution were detected at relapse 
(Fig. 2b and Supplementary Fig. 3): in cases with pattern 1, the dominant 
clone in the primary tumour gained additional mutations and evolved 
into the relapse clone (UPN 804168, UPN 573988 and UPN 400220). 
These patients may simply be inadequately treated (for example, elderly 
patients who cannot tolerate aggressive consolidation, like UPN 
573988), or they may have mutations in their founding clones (or 
germline variants) that make these cells more resistant to therapy 
(UPN 804168 and UPN 400220). In patients with pattern 2, a minor 
subclone carrying the vast majority (but not all) of the primary tumour 
mutations survived, gained mutations, and expanded at relapse; a sub- 
set of primary tumour mutations was often eradicated by therapy, and 
were not detected at relapse (UPN 758168, UPN 933124, UPN 452198, 
UPN 426980 and UPN 869586). Specific mutations in a key subclone 
may contribute to chemotherapy resistance, or the mutations import- 
ant for relapse may be acquired during tumour evolution, or both. 
Notably, in cases 426980 and 758168, a second primary tumour clone 
survived chemotherapy and was also present at relapse (Fig. 1c and 
Supplementary Fig. 3). Owing to current technical limits in our ability 
to detect mutations in rare cells (mostly related to currently achievable 
levels of coverage with whole genome sequencing), our models repres- 
ent a minimal estimate of the clonal heterogeneity in AML. 

All eight patients received cytarabine and anthracycline for induc- 
tion therapy, and additional cytotoxic chemotherapy for consolidation; 
treatment histories are summarized in Supplementary Table 2 and 
described in Supplementary Information. To investigate the potential 
impact of treatment on relapse mutation types, we compared the six 
classes of transition and transversion mutations in the primary tumour 
with the relapse-specific mutations in all eight patients (Fig. 3a). 
Although CeG->TeA transitions are the most common mutations 
found in both primary and relapse AML genomes, their frequencies 
are significantly different between the primary tumour mutations 
(51.1%) and relapse-specific mutations (40.5%) (P= 2.99 x 10°”). 
Moreover, we observed an average of 4.5%, 5.3% and 4.2% increase 
in AeT+>CeG (P=9.13 X10’), CeG>AsT (P=0.00312) and 
CeG—GeC (P=0.00366) transversions, respectively, in relapse- 
specific mutations. Notably, an increased AeT—CeG transversion rate 
has also been observed in cases of chronic lymphocytic leukaemia with 
mutated immunoglobulin genes'’. CeG—>AeT transversions are the 
most common mutation in lung cancer patients who were exposed to 
tobacco-borne carcinogens'® (Fig. 3b and Supplementary Table 11). We 
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Figure 3 | Comparison of mutational classes between primary tumours and 
relapse samples. a, Fraction of the primary tumour and relapse-specific 
mutations in each of the transition and transversion categories. b, Transversion 
frequencies of the primary tumour and relapse-specific mutations from eight 
AML tumour and relapse pairs. 456 relapse-specific mutations and 3,590 
primary tumour mutations from eight cases were used for assessing statistical 
significance using proportion tests. 
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examined the 456 relapse-specific mutations and 3,590 primary tumour 
point mutations from all eight cases as a group, and found that the 
transversion frequency is significantly higher for relapse-specific 
mutations (46%) than for primary tumour mutations (30.7%) 
(P=3.71X 10"), indicating that chemotherapy has a substantial 
effect on the mutational spectrum at relapse. Similar results were 
obtained when we limited the analysis to the 213 mutations that had 
0% variant frequency in the primary tumour samples (Supplementary 
Fig. 1b); the transversion frequency for relapse-specific mutations was 
50.4%, versus 31.4% for primary tumour samples (P = 3.89 X 10). 
Very few copy-number alterations were detected in the eight relapse 
samples, suggesting that the increased transversion rate is not asso- 
ciated with generalized genomic instability (Supplementary Fig. 12). 

We first described the use of deep sequencing to define precisely the 
variant allele frequencies of the mutations in the AML genome of case 
933124 (ref. 3), and here have refined and extended this technique to 
examine clonal evolution at relapse. The analysis of eight primary 
AMLand relapse pairs has revealed unequivocal evidence fora common 
origin of tumour subpopulations; a dominant mutation cluster repre- 
senting the founding clone was discovered in the primary tumour 
sample in all cases. The relationship of the founding clone (and sub- 
clones thereof) to the ‘leukaemia initiating cell’ is not yet clear— 
purification of clonal populations and functional testing would be 
required to establish this relationship. We observed the loss of primary 
tumour subclones at relapse in four of eight cases, suggesting that some 
subclones are indeed eradicated by therapy (Figs 1 and 2 and Sup- 
plementary Fig. 3). Some mutations gained at relapse may alter the 
growth properties of AML cells, or confer resistance to additional 
chemotherapy. Regardless, each tumour displayed clear evidence of 
clonal evolution at relapse and a higher frequency of transversions that 
were probably induced by DNA damage from chemotherapy. 
Although chemotherapy is required to induce initial remissions in 
AML patients, our data also raise the possibility that it contributes to 
relapse by generating new mutations in the founding clone or one of its 
subclones, which then can undergo selection and clonal expansion. 
These data demonstrate the critical need to identify the disease-causing 
mutations for AML, so that targeted therapies can be developed that 
avoid the use of cytotoxic drugs, many of which are mutagens. 

This study extends the findings of previous studies'?*', which 
recently described patterns of clonal evolution in ALL patients using 
fluorescence in situ hybridization and/or copy number alterations 
detected by SNP arrays, and it enhances the understanding of genetic 
changes acquired during disease progression, as previously described 
for breast and pancreatic cancer metastases****. Our data provide 
complementary information on clonal evolution in AML, using a 
much larger set of mutations that were quantified with deep sequen- 
cing; this provides an unprecedented number of events that can be 
used to define precisely clonal size and mutational evolution at relapse. 
Both ALL and AML share common features of clonal heterogeneity at 
presentation followed by dynamic clonal evolution at relapse, includ- 
ing the addition of new mutations that may be relevant for relapse 
pathogenesis. Clonal evolution can also occur after allogeneic trans- 
plantation (for example, loss of mismatched HLA alleles via a uni- 
parental disomy mechanism), demonstrating that the type of therapy 
itself can affect clonal evolution at relapse*®”’. Taken together, these 
data demonstrate that AML cells routinely acquire a small number of 
additional mutations at relapse, and suggest that some of these muta- 
tions may contribute to clonal selection and chemotherapy resistance. 
The AML genome in an individual patient is clearly a ‘moving target’; 
eradication of the founding clone and all of its subclones will be 
required to achieve cures. 


METHODS SUMMARY 


Illumina paired-end reads were aligned to NCBI build36 using BWA 0.5.5 (http:// 
sourceforge.net/projects/bio-bwa/). Somatic mutations were identified using 
SomaticSniper’* and a modified version of the SAMtools indel caller. Structural 
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variations were identified using BreakDancer’. All predicted non-repetitive so- 
matic SNVs, indels and all structural variants were included on custom sequence 
capture arrays from Roche Nimblegen. Illumina 2 X 100-bp paired-end sequen- 
cing reads were produced after elution from capture arrays. VarScan® and a read 
remapping strategy using Crossmatch (P. Green, unpublished data) and BWA 
were used for determining the validation status of predicted SNVs, indels and 
structural variants. A complete description of the materials and methods is pro- 
vided in Supplementary Information. All sequence variants for the AML tumour 
samples from eight cases have been submitted to dbGaP (accession number 
phs000159.v4.p2). 
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Galectin 8 targets damaged vesicles for autophagy to 
defend cells against bacterial invasion 


Teresa L. M. Thurston!, Michal P. Wandel', Natalia von Muhlinen!, Agnes Foeglein' & Felix Randow! 


Autophagy defends the mammalian cytosol against bacterial infec- 
tion’ °. Efficient pathogen engulfment is mediated by cargo-selecting 
autophagy adaptors that rely on unidentified pattern-recognition or 
danger receptors to label invading pathogens as autophagy cargo, 
typically by polyubiquitin coating*°. Here we show in human cells 
that galectin 8 (also known as LGALSS8), a cytosolic lectin, is a danger 
receptor that restricts Salmonella proliferation. Galectin 8 monitors 
endosomal and lysosomal integrity and detects bacterial invasion by 
binding host glycans exposed on damaged Salmonella-containing 
vacuoles. By recruiting NDP52 (also known as CALCOCO2), 
galectin 8 activates antibacterial autophagy. Galectin-8-dependent 
recruitment of NDP52 to Salmonella-containing vesicles is transient 
and followed by ubiquitin-dependent NDP52 recruitment. Because 
galectin 8 also detects sterile damage to endosomes or lysosomes, as 
well as invasion by Listeria or Shigella, we suggest that galectin 8 
serves as a versatile receptor for vesicle-damaging pathogens. Our 
results illustrate how cells deploy the danger receptor galectin 8 to 
combat infection by monitoring endosomal and lysosomal integrity 
on the basis of the specific lack of complex carbohydrates in the 
cytosol. 

Galectins are B-galactoside-binding lectins that accumulate in the 
cytosol before being secreted via a leader-peptide-independent path- 
way’”"'. The best-characterized functions of galectins are performed 
extracellularly, where they bind glycans to modulate cellular behaviour. 
However, the occurrence of galectins in the cytosol, which under 
physiological conditions is devoid of complex carbohydrates, makes 
them prime candidates for a role as danger and/or pattern-recognition 
receptors. Galectin 3 (also known as LGALS3) accumulates on damaged 
bacteria-containing vesicles, although the functional consequences of 
its recruitment remain unknown'””’. We screened a panel of human 
galectins for their ability to detect invasion by Salmonella enterica 
serovar Typhimurium. At 1h post-infection (p.i.), galectin 3, 8 and 9 
accumulated on about 10% of intracellular S. Typhimurium (Fig. la, b 
and Supplementary Fig. 1a), of which 90% were associated with LAMP1 
(Supplementary Fig. 1b). Recruitment of galectins peaked between 1h 
and 2h p.i. (Supplementary Fig. 1c). As galectin 3, 8 and 9 were 
recruited to Salmonella-containing vesicles (SCVs), we used short inter- 
fering RNAs (siRNAs) to test whether their depletion causes hyperpro- 
liferation of S. Typhimurium. Cells lacking galectin 8 or NDP52, but not 
galectin 3 and/or 9, failed to suppress proliferation of S. Typhimurium 
(Fig. 1c and Supplementary Figs 2a—c and 3a). Microscopic analysis 
confirmed that the greater bacterial burden of cells lacking galectin 8 
was caused by enhanced proliferation rather than differential uptake of 
bacteria (Supplementary Fig. 3b). Hyperproliferating bacteria in cells 
lacking galectin 8 appeared mainly in a LAMP1-negative compartment 
(Supplementary Fig. 3c), consistent with colonization of the cytosol. We 
conclude that galectin 8 is an antibacterial restriction factor. 

As autophagy provides antibacterial protection to cells, the decora- 
tion of SCVs with galectins might be an autophagy-inducing signal, 
analogous to ubiquitin coating’*’*. We therefore tested binding of 
galectins to autophagy receptors that restrict the proliferation of S. 
Typhimurium, that is, NDP52, p62 and optineurin’’. We found in a 


luminescence-based mammalian interactome mapping (LUMIER) 
assay that galectin 8 and NDP52 interacted specifically (Fig. 2a and 
Supplementary Fig. 4a). Binding was confirmed by precipitating 
endogenous NDP352 with Flag-tagged galectin 8 (Fig. 2b). 

SCVs double labelled by endogenous galectin 8 and NDP52 were 
prominent in Salmonella-infected HeLa cells (Fig. 2c). In cells expres- 
sing yellow fluorescent protein (YFP)-tagged galectins the majority of 
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Figure 1 | Galectin 8 responds to infection by S. Typhimurium and restricts 
bacterial proliferation. a, b, Analysis of HeLa cells stably expressing YFP fused 
to the indicated galectins and infected with S. Typhimurium for 1h. 

a, Percentage of bacteria coated by the indicated galectins. YFP-positive 
bacteria were counted by microscopy. Mean and standard deviation (s.d.) of 
triplicate HeLa cultures, n > 100 bacteria per coverslip. b, Confocal 
micrographs. Arrowheads, bacteria shown in insets. DAPI, 4',6-diamidino-2- 
phenylindole. c, Kinetics of fold replication for S. Typhimurium in HeLa cells 
transfected with the indicated siRNAs. Bacteria were counted on the basis of 
their ability to form colonies on agar plates. Mean and s.d. of triplicate HeLa 
cultures and duplicate colony counts. siRNAs are further characterized in 
Supplementary Fig. 2a—c. **P < 0.01, Student’s t-test. Scale bar, 10 jim. 
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Figure 2 | Galectin 8 binds NDP52. a, LUMIER binding assay: normalized 
ratio between luciferase activity bound to beads and present in lysates. Lysates 
of 293ET cells expressing NDP52, p62 or optineurin each fused to luciferase, 
and the indicated Flag-tagged galectins were incubated with anti-Flag beads. 
Flag-tagged proteins are further characterized in Supplementary Fig. 4a. 

b, Lysates of 293ET cells, expressing Flag-tagged proteins as indicated, were 
immunoprecipitated with anti-Flag beads. Lysates and immunoprecipitates 
(IP) were blotted for the presence of Flag-tagged proteins and endogenous 


galectin-positive SCVs had accumulated NDP52 (Fig. 2d and 
Supplementary Fig. 5a). Furthermore, at 1h p.i. NDP52 and galectin 
8 co-localized tightly in a pattern distinct from p62 or ubiquitin 
‘microdomains’'® (Supplementary Fig. 5b, c). 

To characterize further the interaction between galectin 8 and 
NDP352 we determined their respective binding sites. Galectin 8 con- 
tains two carbohydrate-recognition domains (CRD) (Supplementary 
Fig. 6a). NDP52 bound galectin-8, 12 (that is, amino acids 229-360), 
equivalent to the second CRD, but not galectin-8)_23 (that is, amino 
acids 1-228) (Supplementary Fig. 6b). NDP52 harbours a SKICH 
domain, a coiled coils-forming region, and a ubiquitin-binding zinc 
finger (Supplementary Fig. 6a). Galectin 8 bound NDP52,_393 but not 
NDP52)_370 (Supplementary Fig. 6c). The NDP52 fragment spanning 
residues 370-393 is therefore essential for binding galectin 8. This 
fragment, as well as NDP5237 3g, purified as GST-fusion proteins, 
bound galectin 8 (Supplementary Fig. 6d). A point mutation within 
NDP52372-380 (L374A) abrogated binding to galectin 8, without com- 
promising binding to ubiquitin when introduced into full-length 
NDP352 (Supplementary Fig. 6d, e). Binding of galectin 8 to NDP52 
is direct, as the purified proteins interacted (Supplementary Fig. 6f). 

To determine whether one monomer of the NDP52-galectin-8 
heteromeric complex recruits the other partner, the accumulation of 
galectins on SCVs in cells depleted of NDP52 or TBK1 was analysed 
(Fig. 3a and Supplementary Fig. 2). Galectin 3, 8 and 9 re-distributed 
normally to SCVs in siRNA-treated cells. In contrast, in cells depleted 
of galectin 8 NDP52 did not localize to SCVs at 1h p.i., a phenotype 
that was complemented upon expression of siRNA-resistant galectin 8 
(Fig. 3b and Supplementary Fig. 7). Cells lacking galectin 3 or galectin 9 
had no defect in recruiting NDP52 to SCVs. The recruitment of 
NDP352 to SCVs is therefore specifically mediated by galectin 8, while 
NDP52 and TBK1 are dispensable for the accumulation of galectins on 
SCVs. 

Rupture of SCVs exposes the cytosol to host glycans and microbial 
carbohydrates, either or both of which may cause galectin 8 accumula- 
tion at SCVs. The requirement for carbohydrate binding by galectin 
8 was tested using point mutations in either CRD’. In contrast to 
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NDP52. c, Confocal images of HeLa cells infected with S. Typhimurium for 1h 
and stained with antisera against NDP52 and galectin 8. Arrowheads, bacteria 
shown in insets. d, Co-localization of NDP52 with galectin-positive bacteria in 
HeLa cells stably expressing YFP fused to the indicated galectins, infected with 
S. Typhimurium and stained with NDP52 antiserum 1h after infection. Mean 
and s.d. of duplicate HeLa cultures, n > 100 bacteria per coverslip, 
representative of two independent experiments. Scale bar, 10 jum. 


galectin-8(R232H), galectin-8(R69H) did not accumulate at SCVs, 
showing that the amino-terminal CRD is required for carbohydrate- 
dependent recruitment of galectin 8 to SCVs (Fig. 3c). To test whether 
the carbohydrates detected by galectin 8 are of microbial origin, bind- 
ing of recombinant galectin 8 to bacteria in vitro was analysed. Galectin 
8 did not bind to S. Typhimurium but stained blood-group-B-positive 
bacteria (Escherichia coli strain O86)" (Fig. 3d), suggesting that galectin 
8, when accumulating on SCVs, recognizes host glycans. The occur- 
rence of galectin-8 ligands in host cells was confirmed by staining HeLa 
cells with recombinant galectin 8 (Fig. 3d). Direct evidence that host 
glycans recruit galectins to SCVs was obtained from experiments with 
CHO-Lec3.2.8.1 cells!®, which lack mature glycans and in which recruit- 
ment of galectins to SCVs was severely impaired (Fig. 3e). The detection 
of host glycans on damaged vesicles by galectin 8 suggests that it is not a 
receptor specific for S. Typhimurium. We therefore tested whether 
sterile damage to vesicles is detected by galectins. Osmotic damage of 
endosomes induced dense puncta formed by galectin 3, 8 and 9 but not 
by galectin 1 (Fig. 3f and Supplementary Fig. 8). Damage to lysosomes 
by glycyl-L-phenylalanine 2-naphthylamide (GPN) treatment resulted 
in the initial loss of lysotracker staining, followed by the appearance of 
galectin 3, 8 and 9 speckles (Supplementary Fig. 9a). In contrast to 
damaged SCVs and endosomes, burst lysosomes were also detected 
by galectin 1, suggesting compartment-specific differences in the dis- 
tribution of galectin ligands. GPN failed to induce speckles of galectin- 
8(R69H) (Supplementary Fig. 9b), thereby indicating that binding of 
glycans to the N-terminal CRD of galectin 8 is required to detect lyso- 
somal damage. The capacity of galectin 3, 8 and 9 to detect vesicle 
damage by binding exposed host glycans suggests their ability to sense 
the invasion of cells by a wide range of vesicle-damaging pathogens. 
Indeed, galectin 3, 8 and 9 also accumulated around Gram-positive 
Listeria monocytogenes and Gram-negative Shigella flexneri (Sup- 
plementary Fig. 10), proving that these galectins detect the invasion 
of cells by phylogenetically distant bacteria. We conclude that galectin 
3, 8 and 9 are danger receptors that sense the exposure of host glycans 
on ruptured membranes and thereby monitor the integrity of the 
endosomal/lysosomal compartment. 
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Figure 3 | Galectin 8 is a danger receptor that senses cytosolic host glycans 
and recruits NDP52 to restrict Salmonella proliferation. a, Percentage of S. 
Typhimurium coated by the indicated galectins. HeLa cells stably expressing 
YFP-tagged galectins were treated with the indicated siRNAs. YFP-positive 
bacteria were counted by microscopy at 1h p.i. siRNAs are further 
characterized in Supplementary Fig. 2. b, Analysis of HeLa cells treated with the 
indicated siRNAs and stained with NDP52 antiserum. NDP52-positive bacteria 
were counted by microscopy 1h after infection with S. Typhimurium. siRNAs 
are further characterized in Supplementary Fig. 2. c, Percentage of bacteria 
coated by the indicated galectin 8 alleles. HeLa cells stably expressing the 
indicated galectin 8 alleles fused to YFP were infected with S. Typhimurium. 
YFP-positive S. Typhimurium were counted by microscopy at 75 min p.i. WT, 
wild type. d, Binding of galectin 8 to bacteria and HeLa cells. The indicated 
bacteria and HeLa cells were incubated with His-GST-ubiquitin (Ub), His- 
GST-galectin 8 or buffer as indicated, followed by murine anti-His antibody 


To test whether the recruitment of NDP52 to SCVs is essential for 
the antibacterial function of galectin 8, we depleted cells of galectin 8 
and targeted NDP52 artificially to SCVs by fusing it to galectin 3 
(Fig. 3g and Supplementary Fig. 4b). S. Typhimurium hyperprolifer- 
ated in galectin-8-depleted cells, while expression of galectin 3 fused to 
NDP352 restored the cells’ restrictive capacity. Artificial targeting of 
NDP52 to SCVs therefore compensates for the lack of galectin 8, which 
strongly suggests that the recruitment of NDP52 via galectin 8 is 
essential to efficiently antagonize bacterial invasion. 

NDP32 restricts the proliferation of S. Typhimurium by targeting 
bacteria for autophagy*. As galectin 8 recruits NDP52 to SCVs, we 
investigated whether galectin 8 is required upstream of NDP52 for the 
induction of antibacterial autophagy. First, we confirmed that at Lh 
p.i., S. Typhimurium that had been sensed by galectin-8, and therefore 
had acquired an NDP52 coat, were taken up into LC3-positive 
autophagosomes (Fig. 4a). Such an outcome was predicted from the 
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and PE-labelled anti-mouse serum. e, Percentage of S. Typhimurium coated by 
the indicated galectins. Wild-type CHO cells and mutant Lec3.2.8.1 cells stably 
expressing YFP-tagged galectins were infected with S. Typhimurium. YFP- 
positive bacteria were counted by microscopy at 1h p.i. f, Confocal images of 
HeLa cells expressing the indicated YFP-tagged galectins. Cells were left 
untreated or were exposed to hypertonic conditions, with or without (w/o) PEG 
as indicated, followed by hypotonic shock. g, Fold replication of S. 
Typhimurium in HeLa cells expressing the indicated galectin 3 variants and 
transfected with the indicated siRNAs. At 2h and 6h after infection, cells were 
lysed and bacteria counted on the basis of their ability to form colonies on agar 
plates. Galectin 3 proteins are further characterized in Supplementary Fig. 4b. 
Mean and s.d. of duplicate coverslips (a—c, e) or triplicate HeLa cultures and 
duplicate colony counts (g). >100 bacteria counted per coverslip. Data are 
representative of at least two repeats. *P < 0.05, Student’s f-test. Scale bar, 

10 um. 


pairwise co-localization of NDP52 with galectin-8- and LC3-positive 
bacteria (Fig. 2d and Supplementary Fig. 11). We then tested whether 
depletion of galectin 8 impairs autophagy of S. Typhimurium. In the 
absence of galectin 8 fewer bacteria were targeted by LC3 (Fig. 4b) and 
of the remaining LC3-positive bacteria fewer had accumulated NDP52 
(Supplementary Fig. 11). In contrast, galectin-8 recruitment to SCVs 
did not require autophagy as it occurred undisturbed in ATGS /~ 
fibroblasts (Supplementary Fig. 12). We conclude that the danger 
receptor galectin 8, by recruiting NDP52, directs autophagy towards 
invading bacteria. 

The recruitment of NDP52 to invading bacteria is mediated by two 
signals, the newly discovered carbohydrate-dependent galectin-8 
pathway and the previously known ubiquitin-dependent pathway’. 
Their differential contribution to the recruitment of NDP52 to 
S. Typhimurium was investigated by analysing NDP52 mutants selec- 
tively disabled to bind galectin 8 and/or ubiquitin. For accurate scoring, 
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Figure 4 | The antibacterial effect of galectin 8 is mediated by autophagy. 
a, Confocal micrograph of HeLa cells expressing GFP-LC3 and mCherry- 
galectin-8, stained for NDP52 1h after infection with S. Typhimurium. The 
lower right panel contains a fluorescence line scan along the yellow line in the 
merge inset. Arrowheads, bacteria shown in insets. b, Percentage of GEP-LC3- 
positive S. Typhimurium at 1h p.i. in HeLa cells treated with the indicated 
siRNAs. c, Percentage of bacteria positive for NDP52. HeLa cells expressing the 
indicated NDP52 variants fused to YFP were infected with S. Typhimurium. 
d, Percentage of bacteria positive for the indicated markers. HeLa cells, either 
wild type or expressing YFP-galectin-8 as indicated, were infected with S. 
Typhimurium. Ubiquitin was detected by antibody staining. e, HeLa cells 
treated with the indicated siRNAs were infected with S. Typhimurium and 
stained for NDP52 at the indicated time points. Fluorescent bacteria were 
counted by microscopy at the indicated time points. Graphs, representing at 
least two independent repeats, show mean and s.d. of duplicate coverslips for 
which >200 bacteria were counted. *P < 0.05, Student’s t-test. Scale bar, 10 um. 


NDP52ASKICH was used, as this truncated allele is distributed 
diffusely throughout the cytosol. NDP52127_445 associated with SCVs 
at all time points investigated (Fig. 4c). Deleting the carboxy-terminal 
ubiquitin-binding zinc finger (NDP52)57 429) impaired the recruit- 
ment of NDP352 to S. Typhimurium at late but not early time points. 
In contrast, NDP5297_446(L374A), which lacks affinity for galectin 8 
but not ubiquitin (Supplementary Fig. 6e), did not co-localize with 
bacteria at 1h p.i. but accumulated progressively over time (Fig. 4c). 
NDP32 is therefore recruited to SCVs in two phases—an early transient 
surge driven by galectin 8 and a later wave dependent on ubiquitin. The 
kinetics of galectin 8 and ubiquitin recruitment to SCVs support this 
model as SCVs are marked by galectin 8 only at early time points, 
whereas ubiquitin marks persist (Fig. 4d). Direct evidence for early 
galectin-8-dependent and late galectin-8-independent recruitment of 
NDP32 to S. Typhimurium was obtained from cells depleted of galectin 
8, in which NDP52 and bacteria co-localized at 4h but not at 1h p.i. 
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(Fig. 4e). NDP52}27-420(L374A), deficient in binding to ubiquitin and 
galectin 8, did not translocate to SCVs at any time point tested (Fig. 4c). 
Taken together, NDP52 relocates to SCVs in response to two signals, 
which are active against bacteria at different stages of invasion. The 
early response to invading bacteria requires the galectin-8-dependent 
pathway, whereas the zinc-finger-dependent pathway dominates at 
later time points. 

The galectin-8/NDP52 pathway sheds light on why most intracellular 
bacteria avoid the cytosol and prefer vesicular compartments. The 
cytosol seems to be protected by synergistic layers of antibacterial 
defence that activate autophagy at distinct steps of the invasion process. 
An early line of defence comprises the accumulation of diacylglycerol 
on bacteria-containing vesicles, which subsequently become the target 
of autophagy”. Bacteria escaping the diacylglycerol pathway and expos- 
ing host glycans on their damaged vacuoles are targeted by galectin 8 
and NDP52, as described in this work. A third layer of defence coats 
invading bacteria with polyubiquitin®°”®. Neither the enzymatic 
machinery for nor the substrate of ubiquitylation have been identified, 
although LRSAM1, a RING-finger E3 ubiquitin ligase, contributes to 
autophagy of S. Typhimurium’. Peptidoglycan and septin cages sur- 
rounding cytosolic bacteria also contribute to autophagy **. Defects in 
this intricate network of autophagy-inducing defence pathways are 
likely to cause susceptibility to infection and promote inflammation, 
for example in Crohn’s disease**”’. Galectin 8 is positioned strategically 
at the cellular entry point for a variety of pathogens and is therefore 
expected to have shaped pathogen evolution. 


METHODS SUMMARY 


Galectins were cloned as YFP fusions and transduced into HeLa cells. HeLa cells 
were infected with S. Typhimurium strain 12023. For confocal microscopy cells 
were fixed in paraformaldehyde. Bacterial growth was assessed by a gentamycin 
protection assay. Knockdowns were accomplished with Stealth siRNAs. LUMIER 
assays were performed as described*’. For flow-cytometric analysis samples were 
incubated with lysates of E. coli expressing His-GST fusion proteins, followed by 
anti-His antibody and goat anti-mouse serum. Statistical testing was performed 
using two-tailed Student's ¢-test. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Antibodies. Antibodies were from QIAGEN (Penta-His), the Developmental 
Studies Hybridoma Bank (LAMP1), BD Transduction Laboratories (p62), Santa 
Cruz (GAL8-H80, TBK1-C100), R&D Systems (galectin 8), Transduction 
Laboratories (NDP52, for western blots), Enzo Life Science (ubiquitin FK2), Sigma 
(ATGS, Flag M2), Dabco (HRP-conjugated reagents), Jackson ImmunoResearch 
Laboratories (goat anti-mouse-phycoerythrin (PE)) and Invitrogen (Alexa- 
conjugated anti-mouse and anti-rabbit antisera). The antiserum against NDP52 
used for immunofluorescence was a gift from J. Kendrick-Jones. 

Plasmids. M5P or closely related plasmids were used to produce recombinant 
MLV for the expression of proteins in mammalian cells*'. pETM plasmids were 
gifts from A. Geerlof. Open reading frames encoding human galectins, NDP52, 
p62, optineurin, ubiquitin, ATG5 and LC3C were amplified by PCR or have been 
described***. Mutations were generated by PCR and verified by sequencing. 
Bacteria. S. Typhimurium (strain 12023), provided by D. Holden, was grown 
overnight in Luria broth (LB) and sub-cultured (1:33) in fresh LB for 3.5 h before 
infection. HeLa cells in 24-well plates were infected with 20 ul of such cultures for 
15 min at 37°C. Following two washes with warm PBS and an incubation with 
100 ppg ml! gentamycin for 2h cells were cultured in 20 pg ml! gentamycin. To 
enumerate intracellular bacteria, cells from triplicate wells were lysed in 1 ml cold 
PBS containing 0.1% Triton-X-100. Serial dilutions were plated in duplicate on 
TYE agar. 

S. flexneri M90T, provided by C. Tang, was grown overnight in Tryptic Soy 
Broth (TSB) and sub-cultured (1:100) in fresh TSB for 2h before infection. 
Bacteria were resuspended in warm IMDM and 100 pl were added to HeLa cells 
in 24-well plates. Samples were centrifuged for 10 min at 670g. Following incuba- 
tion at 37 °C for 30 min, cells were washed with warm PBS and cultured in 100 pg 
ml gentamycin for 2h and 20 pg ml ' thereafter. 

L. monocytogenes strain EGD (BUG 600), provided by P. Cossart, was grown 
overnight in Brain Heart Infusion (BHI) at 30°C with shaking. Five-hundred 
microlitres of diluted cultures (1:333) were added to HeLa cells in 24-well plates, 
which were centrifuged at 670g for 10 min. Cells were incubated for 1h at 37 °C, 
washed with warm PBS and cultured in media supplemented with 100 1g ml’ 
gentamycin for the next hour and 20 1g ml’ gentamycin thereafter. 

Cell culture. Cells were grown in IMDM supplemented with 10% FCS at 37 °C in 
5% CO>. HeLa cells were obtained from the European Collection of Cell Cultures, 
CHO and Lec3.2.8.1 cells'* were obtained from from P. Stanley, ATG5 ’~ MEFs*3 
from N. Mizushima. 

RNA interference. 5 X 10° cells per well were seeded in 24-well plates. The fol- 
lowing day, cells were transfected with 40 pmol of siRNA (Invitrogen) using 
Lipofectamine 2000 (Invitrogen) in Optimem medium (Invitrogen). Optimem 
was replaced with complete IMDM medium after 4h and experiments were per- 
formed after 3 days. siRNAs targeted the following sequences: siNDP52 
5'-UUCAGUUGAAGCAGCUCUGUCUCCC’; siGAL8 #36 5’-CCCACGCCUG 
AAUAUUAAAGCAUUU; siGAL8 #38 5'-GGACAAAUUCCAGGUGGCUGU 
AAAU; siGAL3 #669 5’-AAGCCCAAUGCAAACAGAAUUGCUU; siGAL3 
#670 5'’-GAGAACAACAGGAGAGUCAUUGUUU; siGAL9 #807 5'-GGCUU 
CAGUGGAAAUGACAUUGCCU; siGAL9 #809 5’-UGUGCAACACGAGGC 
AGAACGGAGG; siTBK1 5’-GACAGAAGUUGUGAUCACA(TT)*. 

To render galectin 8 resistant to siGAL8 #38, silent mutations (underlined) 
GGATAAGTTTCAAGTCGCAGTTAAT were introduced by PCR and con- 
firmed by sequencing. 

Immunoprecipitation and western blot. Post-nuclear supernatants from 2 X 
10° HeLa cells expressing Flag-tagged proteins were obtained following lysis 


(150mM NaCl, 0.1% Triton-X-100, 20mM Tris-HCl (pH 7.4), 5mM EDTA 
and proteinase inhibitors). Protein complexes were immunoprecipitated for 2h 
with Flag agarose before washing. Samples were eluted with Flag peptide and 
separated on 4-12% denaturing Bis-Tris gels (Invitrogen). Visualization following 
immunoblotting was performed using ECL detection reagents (Amersham 
Bioscience). 

LUMIER assays. LUMIER binding assays*®** with pairs of putative interactors, 
one fused to luciferase and the other fused to GST or Flag, were performed in 
LUMIER lysis buffer (150 mM NaCl, 0.1% Triton-X-100, 20 mM Tris-HCl (pH 
7A), 5% glycerol, 5mM EDTA and proteinase inhibitors). GST-fusion proteins 
were immobilized on beads before incubation with the luciferase tagged binding 
partner for 2 h. For Flag-based assays, both proteins were expressed in 293ET cells 
and immobilized using Flag-agarose. After washing in lysis buffer, proteins were 
eluted with glutathione or Flag peptide in Renilla lysis buffer (Promega). Relative 
luciferase activity represents the ratio of activity eluted from beads and present in 
lysates. 

FACS. To examine the binding of galectin 8, bacteria in stationary phase or HeLa 
cells were washed in PBSF (PBS, 2% FCS) and incubated for 30 min at 4°C with 
cleared lysates of E. coli expressing His-GST fusion proteins, followed by incuba- 
tions with anti-His antibody and PE-conjugated goat anti-mouse serum. Bacteria 
were fixed in 4% paraformaldehyde before analysis. 

Sterile damage to vesicles. Endosomes were lysed by exposing cells for 10 min to 
hypertonic medium (0.5 M sucrose in PBS, with or without 10% PEG1000), fol- 
lowed by two PBS washes and an incubation in 60% PBS for 3 min’*. Cells were 
returned to complete medium for 20 min, before being fixed in paraformaldehyde. 
For live imaging of lysosomal damage, cells were labelled for 1h with 100nM 
LysoTracker Red (Invitrogen), washed with PBS, incubated in Leibovitz L15 
medium and, after acquisition of the first image, exposed to 333 uM GPN*’. 
Microscopy. HeLa cells were grown on glass cover slips before infection. After 
infection, cells were washed twice with warm PBS and fixed in 4% paraformaldehyde 
in PBS for 30 min. Cells were washed twice in PBS and then quenched with PBS pH 
7.4 containing 1 M glycine and 0.1% Triton-X-100 for 30 min before blocking for 
30 min in PBTB (PBS, 0.1% Triton-X-100, 2% BSA). Cover slips were incubated 
with primary followed by secondary antibodies for 1 h in PBTB before being mount- 
ing in medium containing DAPI (Vector Laboratories). At least 100 events per slide 
were scored in quantitative assays. Confocal images were taken with a X63, 1.4 
numerical aperture objective on either a Zeiss 710 or a Zeiss 780 microscope. Live 
imaging was performed on a Nikon Eclipse Ti equipped with an Andor Revolution 
XD system and a Yokogawa CSU-X1 spinning disk unit. 
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Multi-isotope imaging mass spectrometry reveals 
slow protein turnover in hair-cell stereocilia 
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Haydn M. Prosser’, James M. Ervasti?, David P. Corey’ & Claude P. Lechene* 


Hair cells of the inner ear are not normally replaced during an 
animal’s life, and must continually renew components of their 
various organelles’. Among these are the stereocilia, each with a 
core of several hundred actin filaments that arise from their apical 
surfaces and that bear the mechanotransduction apparatus at their 
tips. Actin turnover in stereocilia has previously been studied” by 
transfecting neonatal rat hair cells in culture with a B-actin-GFP 
fusion, and evidence was found that actin is replaced, from the top 
down, in 2-3 days. Overexpression of the actin-binding protein 
espin causes elongation of stereocilia within 12-24 hours, also 
suggesting rapid regulation of stereocilia lengths’. Similarly, the 
mechanosensory ‘tip links’ are replaced in 5-10 hours after cleav- 
age in chicken and mammalian hair cells**. In contrast, turnover in 
chick stereocilia in vivo is much slower®. It might be that only 
certain components of stereocilia turn over quickly, that rapid 
turnover occurs only in neonatal animals, only in culture, or only 
in response to a challenge like breakage or actin overexpression. 
Here we quantify protein turnover by feeding animals with a '*N- 
labelled precursor amino acid and using multi-isotope imaging 
mass spectrometry to measure appearance of new protein. 
Surprisingly, in adult frogs and mice and in neonatal mice, in vivo 
and in vitro, the stereocilia were remarkably stable, incorporating 
newly synthesized protein at <10% per day. Only stereocilia tips 
had rapid turnover and no treadmilling was observed. Other 
methods confirmed this: in hair cells expressing f-actin-GFP we 
bleached fiducial lines across hair bundles, but they did not 
move in 6 days. When we stopped expression of B- or y-actin with 
tamoxifen-inducible recombination, neither actin isoform left the 
stereocilia, except at the tips. Thus, rapid turnover in stereocilia 
occurs only at the tips and not by a treadmilling process. 

To understand protein turnover in the inner ear, we sought a 
method that could reveal new synthesis with high spatial resolution, 
in adult animals, in vivo, and without transfection of cells. We fed frogs 
with food containing L-leucine tagged with the stable isotope '°N for 
up to 32 days, and killed them by cardiac perfusion. As leucine is an 
essential amino acid, newly synthesized protein would contain '°N in 
nearly the same proportion as the food. To locate '"N-tagged protein, 
plastic sections of frog saccules were placed onto silicon wafers or 
electron microscopy grids for multi-isotope imaging mass spectro- 
metry (MIMS) imaging. We detected nitrogen as the CN ion, 
with the MIMS detectors set for the masses of '*C, °C, '*C'“N and 
12C¢15N, and calculated the !°N/'N ratio at each location in a field”® 
(see Methods). 

A mass-26 ('*C'*N) image of the saccular macula, representing total 
protein, shows the apical surface ofa hair cell with its hair bundle (Fig. 1a, 
left). Lateral resolution approaches 30nm, and depth resolution is 
~2nm (ref. 7). The mass-27 channel ('*C'°N; not shown) revealed 
newly synthesized protein, and the mass-27/mass-26 ratio indicated 


per cent incorporation. A ratio image (Fig. la, right, 32 days of feeding) 
shows that the stereocilia incorporate new protein more slowly than 
the cell body. We quantified turnover by summing the mass-26 and 
mass-27 counts for all pixels in each region of interest (ROI) and 
calculated per cent incorporation (Fig. 1c). Hair-cell cytoplasm and 
cuticular plates incorporated new protein at ~0.5% per day. But 
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Figure 1 | Incorporation of '°N into frog saccular epithelium. a, Left, mass- 
26 image for a hair cell (HC) and supporting cells (SC) at day 32. CP, cuticular 
plate; HB, hair bundle; OM, otolithic membrane. Right, image of mass-27/ 
mass-26 ratio, revealing incorporation of '°N. Colour scale represents 0-30% 
incorporation. Field is 20 X 20 tum. b, Top, mass-26 image of the shortest 
stereocilium in a. Bottom, profile of incorporation along its length. c, Turnover 
for days 1, 2, 4, 8, 21 and 32. cutic, cuticular plate; cyto, cytoplasm. Data are 
shown as mean + standard error of the mean (s.e.m.); N was typically 8-10. 
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stereocilia incorporated new protein at <0.3% per day, far lower than 
the 50% per day suggested for actin in mammalian cochlea’. One 
difference is that MIMS assesses total protein; perhaps actin turns over 
rapidly but most other proteins are very stable. However, actin is 50- 
60% of the total protein in stereocilia®'® and would account for the 
majority of the MIMS signal. 

Surprisingly, incorporation seemed higher at the very tips of stereocilia 
(Fig. 1b). Incorporation at the tip almost equalled that of cytoplasm 
(Fig. 1c). In this bundle, incorporation at the tips of all stereocilia was 
12.5%, compared to 6.1% in the shafts. We also measured incorpora- 
tion in individual ROIs along the length of a stereocilium. Of 37 frog 
stereocilia, there was no correlation with height in 31 and positive 
correlation in just one, inconsistent with movement of the ISN label 
from top to bottom. 

Protein turnover in bullfrog stereocilia was much slower than that 
inferred for mouse and rat hair cells*"', leading us to wonder whether 
turnover is more rapid in mammals. We fed adult mice '*N-leucine 
and killed them after 1, 2, 8, 32, 56 or 150 days. In the mouse vestibular 
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Figure 2 | Incorporation of !N into adult mouse hair cells. a, Mass-26 image 
of utricle; day 56. b, Mass-27/mass-26 ratio: low incorporation in stereocilia. 

c, Projection of a three-dimensional stack of a. d, Ratio image from e: high 
turnover at tips. e, Incorporation, days 1-56. cutic, cuticular plate; cyto, 
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system, '°N incorporation was faster in the cytoplasm of hair cells and 
supporting cells (Fig. 2a, b, e; ~7% per day initially) but still slow in 
stereocilia (<3% per day initially). This was not a peculiarity of actin- 
based structures: intestinal-brush-border microvilli showed more 
incorporation than the cytoplasm, and much more than stereocilia 
(Supplementary Fig. 2). 

The MIMS instrument acquires repeated images, slowly etching the 
surface to create thousands of XY planes with a Z resolution of approxi- 
mately 1 nm allowing the creation of a three-dimensional representa- 
tion (Fig. 2c, d and Supplementary Movies 1 and 2). In mouse utricle, 
incorporation was again much higher in the tips than elsewhere in 
stereocilia. Although tip labelling would be expected in the initial phase 
of treadmilling, the high-turnover region never extended much below 
the top micrometre, which apparently represents a fixed zone that does 
not move down by treadmilling. 

We then wondered whether turnover is more rapid in cochlear than 
vestibular hair cells. Apparently not: at 8, 32, 56 and 150 days (Fig. 2f-j), 
cochlear stereocilia had slower incorporation of new protein (~2% 
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cytoplasm; HC, hair cell; SC, supporting cell. f, Cochlear inner hair cell; day 56; 
mass 26. g, Ratio image. h, Three-dimensional reconstruction from 450 images. 
i, Incorporation along axis of stereocilia from h. Bar: plotted length. Line widths: 
95% confidence. j, Incorporation after 8, 32, 56 or 150 days. Colour scale 0-100%. 
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initially) than did cell bodies or cuticular plates (~3.5% initially). 
Moreover, adult stereocilia apparently have a pool of protein that is 
replaced exceedingly slowly: '°N incorporation after 5 months was 
<60%. 

A three-dimensional image of cochlear hair bundles (Fig. 2h and 
Supplementary Movie 3) also showed high turnover in the distal 
micrometres of the sterocilia. At 56 days, oN incorporation was 40% 
along most of the length of individual stereocilia, but >80% at the tips 
(Fig. 2i). Except for the tips, we did not see a gradient of incorporation. 
In addition, the resolution of MIMS revealed higher turnover in the 
region of the cell membrane than in stereocilia cores (Fig. 2i, inset), 
perhaps reflecting rapid diffusion of some membrane proteins’. 

Elsewhere in the cochlea, ‘°N incorporation varied considerably 
(Supplementary Fig. 1). Like the frog otolithic membrane, the mouse 
tectorial membrane was exceptionally stable, with ~10% '°N incorp- 
oration in 5 months. Pillar cell shafts, containing stiff microtubule 
bundles, had just 23% incorporation. The reticular lamina, comprising 
the actin-rich cuticular plates of hair cells, terminal webs of Dieter’s 
cells and their zonulae adherens, showed low turnover. Thus, low 
turnover occurred in structures that convey sound to stereocilia. 

Protein turnover could be more rapid in neonatal mice. To test this, 
mice were moved at birth to surrogate mothers that had been fed '°N for 
24-28 days to raise the '°N content of their milk; pups were nursed for 
0-15 days before being killed. In a second group, mothers were fed '"N 
before and during pregnancy. At birth, pups were moved to 
control nurse mothers, and protein turnover was measured as 15N loss. 
Pups nursed by '°N mothers showed a normal rise in '°N in utricular 
cytoplasm (Supplementary Fig. 3), with 30-50% incorporation in 4 
days. In stereocilia, protein turnover was slower, with 25% in 4 days 
(Supplementary Fig. 3b, g). A three-dimensional rendering also showed 
low turnover in stereocilia, except at the tips (Supplementary Fig. 3c). 

A difficulty in interpretation is that about half the hair bundles in the 
mouse utricle develop postnatally”’, so incorporation of new protein in 
the stereocilia might reflect development rather than turnover. Indeed, 
small bundles, presumably still developing, showed high incorporation 
(Supplementary Fig. 4), and were therefore excluded from the analysis 
in Supplementary Fig. 3g. Another difficulty is the assumption that 
tagged leucine is incorporated equally into proteins. Proteins with a 
lower abundance of leucine than the average (9.36%) would have a 
lower '°N/14N ratio, which might be interpreted as slow turnover. In 
fact, stereocilia proteins do have less leucine. The ten most abundant 
proteins of neonatal rat utricular stereocilia (P. Gillespie, personal 
communication), adjusted for abundance, have 8.53% leucine. This 
is not enough to account for low stereocilia incorporation, but we 
sought a different way to measure turnover. 

We therefore measured the loss of '°N in pups born to '°N-fed 
mothers but nursed by controls, normalizing '°N levels in each struc- 
ture to that at postnatal day (P)0 (Supplementary Fig. 3h). Consistent 
with some postnatal bundle development, small bundles had little or no 
label, but full-sized hair bundles retained !°N for many days, with 70% 
remaining at P4 (Supplementary Fig. 3d—f, h and Supplementary Fig. 4). 

Perhaps turnover in stereocilia is more rapid in vitro. We cultured 
utricles from PO pups and added '°N-leucine after one day. Over four 
more days, hair bundles showed relatively low '°N incorporation (~8% 
per day versus 12-16% per day in cytoplasm; Supplementary Fig. 5). 

Perhaps the fluorescence previously observed’ was due to newly 
synthesized B-actin-GFP that was only a small fraction of the actin 
in the core—in essence a tracer for treadmilling, the bulk of which was 
driven by recycling of actin monomers that are only slowly replaced. 
To assess treadmilling with an alternative to MIMS, and with steady 
rather than transient expression of a tracer, we constructed mouse 
lines expressing f-actin-GFP under the myosin-7a promoter 
(Supplementary Fig. 6). Stereocilia were brightly fluorescent (Fig. 3). 
We dissected utricles at ages PO-P5 and cultured them for 1 day. Ina 
confocal microscope, we selected a field in which the epithelium was 
tilted sideways, and bleached the GFP in a 1-1m-wide line mostly 


LETTER 


perpendicular to the bundle axis, for 5-8 bundles (Fig. 3b, c). A 
three-dimensional confocal stack was acquired, and the utricle placed 
back into culture. Two, four and six days later, each utricle was returned 
for imaging (Fig. 3d). The bleached line was sometimes obscured if the 
bundle axis changed in culture, but the three-dimensional image could 
be used to reconstruct it. In all experiments, we saw no significant 
movement of the bleached line (Fig. 3f). On average, the line moved 
by 2 + 3% of the bundle height in 6 days (Fig. 3g). 

It is possible that our cultures were not healthy, and that treadmill- 
ing was impaired. However, in the same utricles, we observed the 
appearance and growth of new bundles. Moreover, both bleached 
and unbleached hair cells took up the fluorescent dye RH414, a marker 
for functional transduction (data not shown). Thus an independent 
method gave no evidence of treadmilling on a timescale of days. 

The GFP tag could have inhibited treadmilling in Myo7a::}-actin:GFP 
mice. We therefore studied treadmilling by terminating B- or y-actin 
expression and following the redistribution of actin with antibodies. 
Mice expressing a tamoxifen-inducible Cre under the B-actin promoter 
were crossed with mice carrying either a floxed B- or y-actin gene™. At 3 
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Figure 3 | Tracking treadmilling by bleaching GFP-tagged f-actin. 

a, Neonatal utricular macula in culture, hair cells expressed B-actin-GFP. 
Bleached bundles are indicated by asterisks. b, Projection of confocal planes 
before and after bleaching. D1, day 1. c, Three-dimensional stack rotated to 
show oblique bleached line. d, Same bundle on days (D)3, 5 and 7. A shift in 
bundle orientation partially obscured the line. e, Rotation of the day-7 stack 
revealed the bleached line. f, Line positions for three bundles, as a proportion of 
bundle height. g, Average movement of bleached lines, as a proportion of 
bundle height. N = 24; mean + s.e.m. 
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weeks of age, after bundles had developed, mice were given tamoxifen to 
promote excision. One, three, eighteen or thirty-four weeks later, we fixed 
and removed cochleas, and used antibodies specific for B- and y-actin. 

In control mice without tamoxifen, B- and y-actin were uniformly 
distributed (Fig. 4a). If there is rapid treadmilling in adults, B-actin 
protein should be lost from stereocilia soon after excision prevented 
B-actin gene expression. Similarly, y-actin should be lost in mice with a 
floxed y-actin gene. This occurred, but only for a 0.4-lm region at 
stereocilia tips (Fig. 4a, b). The rest of the stereocilium retained the 
deleted actin isoform, and the region of isoform loss did not progress 
towards stereocilia bases, even after 34 weeks. 

The MIMS method provides a new way to study protein stability in the 
inner ear’. MIMS is based on Schoenheimer’s pioneering use of stable 
isotopes to reveal the dynamics of cellular components’, but adds imaging 
with exceptional spatial resolution. '°N incorporation can be calculated 
with an accuracy that depends only on acquisition time. It does not rely 
on tagging any one protein species, or transfecting a specific cell type. 

In contrast to previous studies’, MIMS revealed unexpectedly low 
turnover of protein in hair-cell stereocilia. Incorporation of '°N in 
stereocilia occurred at <0.3% per day in frog, <2% per day in adult 
mouse and <9% per day in neonatal mice, incompatible with tread- 
milling in 2-3 days. By bleaching fluorescently labelled actin in bun- 
dles, we saw no significant movement of the actin. Finally, excision of 
the B- and y-actin genes in mature mice showed loss of the excised 
actin from the stereocilia tips, but not a progressive loss from tip to 
base. We found no evidence for treadmilling of stereocilia cores, over 
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Figure 4 | Loss of actin from tips following knockout of actin genes. 

a, b, Adult (3-week-old) cochlear inner hair cells, labelled with antibodies to 
B-actin (green) or y-actin (red). a, Floxed f-actin mouse crossed to Cagg- 
CreER, without tamoxifen (control), or 1, 3 or 34 weeks (wk) after tamoxifen. 
Loss of B-actin occurred only at tips of stereocilia. In control mice, a few hair 
cells displayed loss of f-actin at tips without tamoxifen, suggesting a leaky 
promoter. b, Floxed y-actin mouse 18 weeks after tamoxifen. Loss of y-actin 
occurred only at tips of stereocilia. 
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weeks or even months. It is possible that the previously observed 
treadmilling was produced by overexpression of B-actin, if the exo- 
genous actin significantly exceeds endogenous actin. Or—at least in 
the vestibular system—addition of actin to hair bundles developing 
postnatally might have confused interpretation. 

Nevertheless, there is a conspicuously dynamic turnover of protein 
at the tips of stereocilia, in the distal 0.3-0.5 um. This could represent 
turnover of transduction components, which can be replaced in hours 
when damaged*”. It could represent movement of proteins to the tips 
by hair-cell myosins, most of which climb from bases to tips. Or it may 
reflect a rapid regulation of actin filaments, which can change length 
by ~0.1 1m when tip links are cut’. If so, there might be both poly- 
merization and depolymerization at the barbed end. Rapid but local 
regulation of filament length is not inconsistent with the limited 
growth of stereocilia induced by overexpression of espin’. 

In long-lived vertebrates, the actin of stereocilia must be replaced. It 
may be that treadmilling does occur, but at a timescale far longer than 
that studied here. It may instead be that individual actin filaments do 
not extend the length of the stereocilium—that free ends can accept or 
shed monomers and that unsynchronized treadmilling occurs on a 
submicroscopic scale. Unlike filopodia, stereocilia seem to develop 
once, elaborating an intricate staircase of heights that persists for the 
life of an animal; in most of their length, they then replace individual 
proteins slowly and without disrupting this exquisite structure. 


METHODS SUMMARY 

Tissue preparation and '°N feeding. Adult bullfrogs (Rana catesbeiana) were 
maintained on a !°N-leucine diet, then fixed and treated with EDTA to dissolve 
otoconia before embedding and sectioning. 

Adult mice (Mus musculus, CD1 strain) were maintained on a ‘°N-leucine diet 
before being killed and processed. For neonatal mice, mothers were maintained on 
'°N food before and during pregnancy; at birth, the pups were moved to a control 
surrogate mother. A second set was born to control mothers and moved at birth to 
a surrogate mother that had been maintained on '°N food. 

Utricles were dissected, cultured overnight in DMEM-F12 with 1 or 6% FBS'° 
and maintained in '°N-enriched medium for up to 5 days. 

MIMS analysis. MIMS was performed as described previously’. 'N incorpora- 
tion was calculated by comparing counts for masses 27 and 26 to the natural 
abundance of '°N (0.37%) and the relative abundance of '°N in the food (1-2%). 
Photobleaching of cultured mouse utricles. We constructed mice expressing a 
B-actin-GFP fusion under the myosin-7a promoter, integrated in the HPRT locus 
on the X chromosome (see Supplementary Fig. 6). Utricles from neonatal pups 
(mosaic for the transgene) were cultured as above. Hair bundles were bleached on 
day 1; images were taken on days 3, 5 and 7. 

Conditional deletion of B- and y-actin. Actg#” and Act" mice, in which exons 
3 and 4 were flanked by loxP sites'* were crossed to mice expressing Cre/Esr1 
recombinase driven by the chicken B-actin promoter coupled to the cytomegalo- 
virus immediate-early enhancer. Mice were dosed with tamoxifen to induce 
recombination. Anti-y-actin antibody clone 1-37 (ref. 15) IgG was conjugated 
to Alexa fluorescent dyes. FITC-labelled anti-f-actin antibodies (clone AC-15) 
were obtained from Abcam. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Tissue preparation and '°N feeding. Adult bullfrogs (Rana catesbeiana) were fed 
an unlabelled control diet for 2-3 days, and were transferred at day 0 toa °N-rich 
diet. Food pellets contained '°N-labelled leucine so that the '"N content was 
1.28%. Bullfrogs were fixed via cardiac perfusion with 4% formaldehyde + 0.1% 
glutaraldehyde. After decapitation, the saccules were exposed and the head and 
carcass stored in 4% formaldehyde + 1% glutaraldehyde. Heads were transferred 
to PBS before dissection of the saccules. Saccules were then treated with EDTA for 
2-10 days to dissolve otoconia before embedding in Epon and sectioning. 

Adult mice (Mus musculus, CD1 strain) ate control food that contained no 
added 1°N, and then were switched to food with '°N-leucine (either 1.28 or 
1.23%) for up to 150 days. Neonatal mice were labelled in two ways. For one 
set, mothers were fed !°N food for 4-8 days, mated, and continued on IN food 
until pups were delivered ('°N mother). At birth, the pups were moved to a 
surrogate mother that had been maintained on control food (‘4N mother). A 
second set was born to '*N mothers and moved at birth to a surrogate mother 
that had been maintained for 24-28 days and was continued on '°N food. Pups 
were nursed by surrogate mothers for 0, 2, 4, 8 or 15 days before being killed and 
fixed. At least two animals (four inner ears) were prepared for each condition. 

Cultured utricles were dissected by standard methods’’. After removing the 
otolithic membranes, the maculae were plated on a sterile portion of ACLAR film 
(EMS) and incubated in DMEM-F12 containing 100g ml * carbenicillin, with 
either 1 or 6% FBS. After an overnight incubation, the medium was replaced with 
15N enriched medium (DN, Cambridge Isotopes, 2.44 or 1.35% TN), The cultures 
were then incubated at 37 °C for up to 5 days until fixation. Tissues were fixed in 
4% formaldehyde + 0.1% glutaraldehyde in cacodylate buffer for 15 min at 22 °C, 
then overnight at 4 °C. Cultures were then moved to cacodylate buffer and held for 
2-3 days at 4°C, before processing in batches for plastic embedding, as above. 
MIMS analysis. In the multi-isotope imaging mass spectrometer’, a ~30nm 
beam of Cs* ions, scanned across the sample, sputters atoms, atomic clusters or 
molecules from the surface, of which a small fraction are ionized. Negative 
secondary ions are accelerated back, then shaped with ion optics and analysed 
with a double sector mass spectrometer employing four detectors, each tuned to a 
specific mass. Because N> is not charged, '°N and '*N were detected as cyanide 
anions, '7C!°N~ and '*C'*N_, mass 27 and mass 26, respectively, and their ratio 
was calculated for each point in the raster scan. 

We calculated the proportion of newly synthesized protein, for each pixel of the 
field, from the ratio of counts in the '*C’°N and *CN images according to 
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where N57 and Ny are the counts in each pixel for masses 27 and 26, A, is the 
natural abundance of '°N (0.37%), and Ar is the relative abundance of '°N in the 
food (1-2%). Incorporation was displayed with a hue saturation intensity trans- 
formation (HSI) in which the hue corresponds to the ratio value, and the intensity 
is an index of statistical reliability*. Image analysis performed in Image]/ 
OpenMIMS (NIH/NRIMS). 3D reconstructions made in ImageVis3D (NIH/ 
NCRR SCI/CIBC). 

Photobleaching of cultured mouse utricles. To label endogenous actin, we con- 
structed mice expressing a B-actin-GFP fusion under the myosin-7a promoter, 


integrated in the HPRT locus on the X chromosome. Correct integration of the 
BAC was confirmed Southern blot using flanking probes. Details of the construc- 
tion, using methods previously described”, are in Supplementary Fig. 6. 

Neonatal pups (heterozygous and therefore mosaic for the transgene) were 

killed between PO and P4, their utricles dissected and cultured as above. 
Dissection was performed on experimental day 0, then stereocilia were bleached 
on day 1 and images were taken on experimental days 3, 5 and 7. Each imaging day, 
the cells were transferred to DMEM-F12 without phenol red and then to the stage 
of an Olympus FV1000 confocal (upright configuration, LUMFL X60 immersion 
objective, 1.1 NA) with a warmed chamber. On day 1, a Z-stack of the selected field 
containing 20-50 hair cells was collected using confocal imaging of the GFP with a 
488 nm laser. Target hair bundles were selected and bleached with one to three 
300-ms flashes of a focused 405 nm laser at 20 ps pixel” '. Each selected bundle in 
the field was separately bleached. After bleaching, another Z-stack for the whole 
field was collected to confirm the bleached locations. The following experimental 
days one Z-stack per field was acquired. Movement of the bleached line was 
assessed by rotating the image stack to observe the line relative to the full length 
of the bundle. 
Conditional deletion of B- and y-actin. The Actg#”** and Actb"* mice, in which 
exons 3 and 4 were flanked by loxP sites'* were crossed to transgenic mice expres- 
sing Cre/Esr1 recombinase driven by the chicken B-actin promoter coupled to the 
cytomegalovirus immediate-early enhancer (Jackson Labs; stock number 004682). 
All mice were on the C57BL/6 background. Mice were dosed with tamoxifen once 
a day for 4 days to induce recombination (20 mg ml’ in corn oil, administered as 
0.15 mg tamoxifen per gram of mouse by oral gavage). Esr1 control of Cre recom- 
binase seemed to be leaky, in that 18-week-old mice that had not been given 
tamoxifen had the same actin-distribution phenotype as mice given tamoxifen; 
both were different from mice lacking CAG-Cre/Esr1. The phenotype was seen in 
a few cells of 3-week-old mice even in the absence of tamoxifen, so we restricted 
quantitative analysis to mice dosed with tamoxifen at 3 weeks. 

Monoclonal mouse anti-y-actin antibody clone 1-37 IgG (ref. 15) was purified 
from ascites using a T-gel Purification kit (Pierce) and conjugated to either Alexa- 
488 or Alexa-568 fluorescent dyes using a Monoclonal Antibody Labelling Kit 
(Invitrogen) following the manufacturer’s instructions. FITC-labelled anti-f-actin 
antibodies (clone AC-15) were obtained from Abcam (validated in ref. 14). 

For labelling, mice were perfused with 4% paraformaldehyde in PBS. Cochleas 
were then dissected and incubated in the same fixative for 2 h at room temperature 
(22 °C), washed in PBS and decalcified in 170 mM EDTA in PBS at 4°C for 16h. 
The organ of Corti was dissected, postfixed in 100% methanol at —20°C for 
10 min, rinsed in PBS and permeabilized in 0.5% Triton X-100 in PBS for 
20 min at room temperature. Tissue was blocked for 1h in 5% goat serum in 
PBS before incubation with the indicated antibodies. Samples were mounted in 
ProLong anti-fade reagent and viewed on a Deltavision PersonalDV deconvolu- 
tion microscope equipped with a 100 1.4 NA objective (Applied Precision). 
Stacks of images were collected at 0.20-\1m intervals and deconvolved using 
Resolve3d software (Applied Precision). 


17. Prosser, H. M., Rzadzinska, A. K., Steel, K. P. & Bradley, A. Mosaic complementation 
demonstrates a regulatory role for myosin Vlla in actin dynamics of stereocilia. Mol. 
Cell. Biol. 28, 1702-1712 (2008). 
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Neuron-type-specific signals for reward and 
punishment in the ventral tegmental area 


Jeremiah Y. Cohen'*, Sebastian Haesler'*, Linh Vong’, Bradford B. Lowell? & Naoshige Uchida! 


Dopamine has a central role in motivation and reward. Dopaminergic 
neurons in the ventral tegmental area (VTA) signal the discrepancy 
between expected and actual rewards (that is, reward prediction 
error)'*, but how they compute such signals is unknown. We 
recorded the activity of VTA neurons while mice associated differ- 
ent odour cues with appetitive and aversive outcomes. We found 
three types of neuron based on responses to odours and outcomes: 
approximately half of the neurons (type I, 52%) showed phasic 
excitation after reward-predicting odours and rewards in a manner 
consistent with reward prediction error coding; the other half of 
neurons showed persistent activity during the delay between odour 
and outcome that was modulated positively (type II, 31%) or nega- 
tively (type III, 18%) by the value of outcomes. Whereas the activity 
of type I neurons was sensitive to actual outcomes (that is, when the 
reward was delivered as expected compared to when it was un- 
expectedly omitted), the activity of type II and type III neurons 
was determined predominantly by reward-predicting odours. We 
‘tagged’ dopaminergic and GABAergic neurons with the light- 
sensitive protein channelrhodopsin-2 and identified them based 
on their responses to optical stimulation while recording. All iden- 
tified dopaminergic neurons were of type I and all GABAergic 
neurons were of type II. These results show that VTA GABAergic 
neurons signal expected reward, a key variable for dopaminergic 
neurons to calculate reward prediction error. 

Dopaminergic neurons fire phasically (100-500 ms) after unpre- 
dicted rewards or cues that predict reward’. Their response to reward 
is reduced when a reward is fully predicted. Furthermore, their activity 
is suppressed when a predicted reward is omitted. From these observa- 
tions, previous studies hypothesized that dopaminergic neurons signal 
discrepancies between expected and actual rewards (that is, they com- 
pute reward prediction error (RPE)), but how dopaminergic neurons 
compute RPE is unknown. 

Dopaminergic neurons make up about 55-65% of VTA neurons; the 
rest are mostly GABAergic inhibitory neurons* °, Many addictive drugs 
inhibit VTA GABAergic neurons, which increases dopamine release 
(called disinhibition), a potential mechanism for reinforcing the effects 
of these drugs’""*, Despite the known role of VTA GABAergic neurons 
inhibiting dopaminergic neurons in vitro’”, little is known about their 
role in normal reward processing. One obstacle has been the difficulty 
of identifying different neuron types with extracellular recording tech- 
niques. Conventionally, spike waveforms and other firing properties 
have been used to identify presumed dopaminergic and GABAergic 
neurons’”’*", but this approach has been questioned recently”'®. We 
thus aimed to observe how dopaminergic and GABAergic neurons 
process information about rewards and punishments. 

We classically conditioned mice with different odour cues that 
predicted appetitive or aversive outcomes. The possible outcomes 
were big reward, small reward, nothing, or punishment (a puff of air 
delivered to the animal’s face). Each behavioural trial began with a 
conditioned stimulus (CS; an odour, 1 s), followed by a 1-s delay and 


an unconditioned stimulus (US; the outcome). Within the first two 
behavioural sessions, mice began licking towards the water-delivery 
tube in the delay before rewards arrived, indicating that they quickly 
learned the CS-US associations (Fig. 1). The lick rate was significantly 
higher preceding big rewards than small ones (paired t-tests between 
lick rates for big versus small rewards for each session, P< 0.05 for 
each mouse). 

We recorded the activity of VTA neurons while mice performed the 
conditioning task. All 95 neurons showed task-related responses (ana- 
lysis of variance (ANOVA), all P< 0.001), thus all recorded neurons 
were used in the following analyses. Observing the temporal profiles of 
responses in trials with rewards, we found neurons that showed firing 
patterns that resemble those of dopaminergic neurons found in non- 
human primates’*">. These neurons were excited phasically by reward- 
predicting stimuli or reward (Fig. 2a, top). We also found many neurons 
with firing patterns distinct from typical dopaminergic neurons. These 
neurons showed persistent excitation during the delay before rewards, 
in response to reward-predicting odours (Fig. 2a, middle). Other 
neurons showed persistent inhibition to reward-predicting odours 
(Fig. 2a, bottom). To characterize the responses of the population, 
we measured the temporal response profile of each neuron during 
big-reward trials by quantifying firing rate changes from baseline in 
100-ms bins using a receiver operating characteristic (ROC) analysis 
(Fig. 2b and Supplementary Fig. 1). We calculated the area under the 
ROC curve (auROC) at each time bin. Values greater than 0.5 indicate 
increases in firing rate relative to baseline, whereas values less than 0.5 
indicate decreases. 

To classify these response profiles, we used principal component 
analysis (PCA) followed by unsupervised, hierarchical clustering. This 
yielded three clusters of neurons that were separated according to (1) 
the magnitude of activity during the delay between CS and US, and (2) 
the magnitude of responses to the CS or US (Fig. 2c). Forty-nine 
neurons (52%) were classified as type I, which showed phasic res- 
ponses. Twenty-nine neurons (31%) were classified as type I, which 
showed sustained excitation to reward-predicting odours, whereas 17 
neurons (18%) were classified as type III, which showed sustained 
inhibition (Fig. 2d). 
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Figure 1 | Odour-outcome association task in mice. a, Licking behaviour 
from a representative experimental session. Black bars indicate CS and US 
delivery. Shaded regions around lick traces denote standard error of the mean 
(s.e.m.). b, Mean ~ s.e.m. licks during the delay between CS and US as a 
function of days of the experiment across animals. 
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Figure 2 | VTA neurons show three distinct response types. a, Responses of 
example neurons. b, Responses of all neurons. Yellow, increase from baseline; 
cyan, decrease from baseline. Each row represents one neuron. The similarity 
order of the three main clusters is arranged to match the order presented in 

a. c, Top: the first three principal components of the auROC curves. Points are 
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coloured based on hierarchical clustering from the dendrogram. Bottom: 
classification of neurons based on response differences between big-reward and 
no-reward trials during the delay versus during the CS or US. d, Average firing 
rates from type I-III neurons. 


To identify dopaminergic neurons, we expressed channelrhodopsin-2 
(ChR2), a light-gated cation channel’””’, in dopaminergic neurons (see 
Methods). We confined expression to dopaminergic neurons by inject- 
ing adeno-associated virus containing FLEX-ChR2 (AAV-FLEX- 
ChR2)”’ into transgenic mice expressing Cre recombinase under the 
control of the promoter of the dopamine transporter (DAT; also called 
Slc6a3) gene (Supplementary Figs 2 and 3). For each neuron, we mea- 
sured the response to light pulses and the shape of spontaneous spikes. 
We observed many neurons that fired after light pulses (Fig. 3a, b). We 
calculated the correlation between the spontaneous spike waveform 
and light-evoked voltage response and plotted it against the energy of 
light-evoked responses for each recording (Fig. 3c). This yielded two 
distinct clusters: one that showed significant responses to light pulses 
and one that did not. To identify dopaminergic neurons stringently, we 
applied the criterion that the light-evoked waveform must look almost 
identical to the spontaneous waveform (correlation coefficient >0.9). 
Twenty-six neurons met this criterion (filled blue points in Fig. 3c). 
Consistent with direct light activation rather than indirect, synaptic 
activation, all 26 neurons showed light-evoked spikes within a few 
milliseconds of light onset with small jitter, and followed high- 
frequency light stimulation of 50 Hz (Supplementary Fig. 4). These 
properties strongly indicate that these 26 neurons expressed ChR2. 
We therefore designate these 26 neurons as identified dopaminergic 
neurons. All identified dopaminergic neurons were of type I. 
Conversely, none of type II or III neurons was activated by light (red 
and grey points in Fig. 3c). 

Next, we asked whether GABAergic neurons could be mapped to 
type II or III neurons. We recorded from 92 VTA neurons in mice 
expressing Cre recombinase under the control of the endogenous 
vesicular y-aminobutyric acid (GABA) transporter (Vgat; also called 
Slc32a1) gene. These mice showed similar licking behaviour to DAT- 
Cre mice (Supplementary Fig. 5). We applied the PCA parameters 


Figure 3 | Identifying dopaminergic and GABAergic neurons. a, Voltage 
trace from 10 pulses of 20 Hz light stimulation (cyan bars). Two light-triggered 
spikes are shown below. b, Response from this neuron to 20 Hz (left) and 50 Hz 
(right) stimulation. Ticks represent spikes. c, Quantification of light-evoked 
responses and mapping of response types in DAT-Cre mice. Blue, type I; red, 
type I]; grey, type III neurons. Identified dopaminergic neurons are indicated by 
filled circles. Abscissa: energy (integral of the squared voltage values, |v*d¢) of 
the light-evoked response from each neuron. Ordinate: cross-correlation 
between the mean spontaneous spike and the light-evoked response. Example 
neurons are shown to the right (black, spontaneous spikes; cyan, light-evoked 
voltages). d, Light-evoked responses in Vgat-Cre mice. Conventions are the 
same as inc. 
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from the 95 neurons from DAT-Cre mice to the 92 neurons from Vgat- 
Cre mice. This yielded 38 type I neurons, 34 type II neurons and 20 
type III neurons. Using the same criteria for GABAergic neurons as we 
used for dopaminergic neurons, we identified 17 GABAergic neurons 
(Fig. 3d and Supplementary Fig. 4). All 34 type II neurons fell in the 
upper cluster in Fig. 3d. We also found type I neurons that were 
inhibited by optical stimulation, consistent with local GABAergic 
stimulation (Supplementary Fig. 6). 

Our data set of identified dopaminergic neurons allows us to char- 
acterize their diversity. We observed that some were excited by reward, 
some were excited by a reward-predicting CS, and some were excited 
by both (Fig. 4a—c). Although previous studies in non-human primates 
found similar variability*°*' (Supplementary Fig. 7), this result may 
suggest that some dopaminergic neurons do not strictly follow canonical 
RPE coding. However, the US responses may be due to the delay 
between CS and US, known to increase the US response due to temporal 
uncertainty”®. In addition, this diversity was correlated with the effect 
of training that occurred over several days across the population of 
dopaminergic neurons, even after animals had reached asymptotic 
behavioural performance (Fig. 1b). Soon after reaching a behavioural 
performance criterion, many dopaminergic neurons showed stronger 
responses to US over CS but the preference gradually shifted to CS over 
several days (Fig. 4d; Pearson correlation, r= 0.42, P< 0.05). This is 
consistent with a previous study in non-human primates that showed 
US responses gradually disappear over >1 month of training”. Thus, 
identified dopaminergic neurons appear to respond to CS and US 
similarly to those reported in non-human primate studies. 

Another important response property that supports RPE coding in 
dopaminergic neurons is their decrease in firing rate when an expected 
reward is omitted’*. We thus omitted reward unexpectedly on 10% of 
big-reward trials in some sessions. Fifteen of seventeen dopaminergic 
neurons showed a decrease in firing rate upon reward omission rela- 
tive to reward delivery (Fig. 4f, g). The two dopaminergic neurons that 
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were not modulated by reward omission were excited by big-reward 
CS, but fired close to 0 spikes s' otherwise; the low firing rate at the 
time of reward left little room to ‘dip’ further. We obtained similar 
results when we compared the firing rate upon reward omission to the 
baseline firing rate (9 of 17 neurons P<0.05, t-test; mean 
auROC = 0.407, tig = 2.56, P<0.05; Supplementary Fig. 8a, b). 
Thus, most dopaminergic neurons coded RPE when expected reward 
was omitted. 

GABAergic neurons showed persistent activity during the delay 
period, which parametrically encoded the value of upcoming out- 
comes (paired t-tests between no-, small- and big-reward trials, all 
P<0.001 for 16 of 17 identified GABAergic neurons, Supplemen- 
tary Fig. 7a; regression slopes, Supplementary Fig. 10i). This suggests 
that these neurons encode expectation about rewards. If this is the case, 
one prediction is that the activity of these neurons is not modulated by 
delivery or omission of reward. Indeed, GABAergic (and unidentified 
type II) and type III neurons were not significantly modulated by the 
presence or absence of reward itself (Fig. 4f, g and Supplementary Fig. 8), 
in contrast to identified dopaminergic neurons. None of the identified 
GABAergic neurons, and only 2 of 17 unidentified type II neurons, 
showed significant decreases in firing rate relative to when reward was 
delivered. None of the 11 type III neurons showed significant modu- 
lation by reward omission. Thus, the activity of type II and III neurons 
was modulated predominantly by reward-predicting cues but not actual 
reward. 

Recent studies have revealed a diversity of responses of dopaminergic 
neurons to aversive stimuli: some are excited, others inhibited'*. To 
test whether this diversity exists in dopaminergic and GABAergic 
VTA neurons, we delivered air puffs in some sessions. Identified 
dopaminergic neurons showed some diversity: although most signifi- 
cant responses were inhibitory, some were excitatory (Fig. 4h, i and 
Supplementary Fig. 9). In contrast, most type II and III neurons (and 
13 of 14 identified GABAergic neurons) were excited by air puffs. 


a —_— Dopaminergic b CS spikes s“ US spikes s™ c d 
"cs "us 18 6 ° . 
7,20 —é { 2 age). ents ot 
g 13 £3 TIE s3° 
£10 } F 2) "2 o.e 88 
8 3 Sis 
8 -0. 
0 T t T ji T T T T iT 0 i T 7 s kd . T 
01 2 8 0.00 3.75 0.00 1 0 1 2 7 12 
Time —- odour (s) Reward size (ul) osus Day 
a h "cs*us i 
Dopaminergic — Big reward g 
'_ Small reward 40 Z a m= 64 
4 1— Nothing 7 4 + 
15 | — Punishment P<0.05 ip 
1 1 
= = 204 34 Ss 3+ 
%, } 0 ! 0 
2 8 g 
@ < o/ ot—— £ o4 04 | 
x a g oe ot OS —oT oo 
& a 9 QD 5 
@ GABAergic 2 a £ 5 
© 804 © 1004 zZ 15 s 2 gl 
= | iz iz 
is 50+ ra 60-4 44 ; 
1 
; u 1 
04 T T t T 04 T T T T 0+; T O4 T T T T o+ T 
0 1 2 3 0 1 2 3 0.0 0.5 1.0 0 1 2 3 0.0 0.5 1.0 
Time — odour (s) Time — odour (s) auROC Time — odour (s) auROC 


(rewarded vs omitted) 


Figure 4 | Response variability based on CS-US preference, reward 
omission and air puffs. a, Response of a dopaminergic neuron during big- 
reward trials. b, Firing rate (mean + s.e.m.) versus reward size (left) and in 
response to big-reward-predicting CS and big-reward US for each 
dopaminergic neuron (right). c, Histogram of CS—-US index for dopaminergic 
neurons. d, CS-US index versus day after the behaviour was learned. e, Average 
responses of dopaminergic and GABAergic neurons. f, Responses of a 
dopaminergic and GABAergic neuron for reward present (solid) and 


(air puff vs baseline) 


unexpectedly absent (dashed) on big-reward trials. g, Histograms of differences 
in firing rates during the outcome period (2-2.5 s) between rewarded and 
reward-omitted trials for dopaminergic (top) and GABAergic (bottom) 
neurons. Values are represented using auROC (<0.5, rewarded < omitted; 0.5, 
no difference; >0.5, rewarded > omitted). Significant values are filled (t-test, 
P<0.05). h, Responses of a dopaminergic and GABAergic neuron during 
punishment trials. i, Histograms of auROC values during the air puff (2-2.5 s) 
relative to baseline (<0.5, decrease; >0.5, increase from baseline). 
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Detecting the discrepancy between expected and actual outcomes 
has a critical role in optimal learning'”””’. Although phasic firing of 
VTA dopaminergic neurons may act as such an error signal, how this is 
computed remains largely unknown. Models have postulated the exist- 
ence of value-dependent, inhibitory input to dopaminergic neurons 
that persists during the delay between a CS and US (Supplementary 
Fig. 1la)'”’. Our data indicate that VTA GABAergic neurons provide 
such an inhibitory input that counteracts excitatory drive from primary 
reward when the reward is expected. In addition, these neurons were 
excited by aversive stimuli, potentially contributing to suppression of 
firing in some dopaminergic neurons in response to aversive events 
(Fig. 4). Previous work showed that VTA GABAergic neurons receive 
inputs from prefrontal cortex and subcortical areas that could provide 
reward-related signals***’. Phasic excitation of VTA GABAergic 
neurons could be driven by inputs from lateral habenula neurons that 
are phasically excited by aversive stimuli”. These habenular neurons do 
not show sustained activity between CS and US, so it is unlikely that 
they provide reward expectation signals to VTA GABAergic neurons. 
Instead, these signals may come from the pedunculopontine nucleus” 
or orbitofrontal cortex” (Supplementary Fig. 11b). VTA GABAergic 
neurons form synapses preferentially onto dendrites of dopaminergic 
neurons’, whereas other inhibitory inputs form synapses onto their 
somata”. Dendritic inhibition is thought to be weaker than somatic 
‘shunting’ inhibition*® but appears well suited for deriving graded 
outputs by ‘arithmetically’ combining excitatory and inhibitory inputs. 

A major effect of drugs of addiction is inhibition of VTA GABAergic 
neurons’*. If VTA GABAergic neurons are involved in computation of 
RPE, inhibition of GABAergic neurons by addictive drugs could lead 
to sustained RPE even after the learned effects of drug intake are well 
established, thereby resulting in sustained reinforcement of drug taking”. 
Understanding local circuits in VTA in the context of learning theory 
may thus provide crucial insights into normal as well as abnormal 
functions of reward circuits. 


METHODS SUMMARY 


All surgical and experimental procedures were in accordance with the National 
Institutes of Health Guide for the Care and Use of Laboratory Animals and 
approved by the Harvard Institutional Animal Care and Use Committee. We 
injected male DAT-Cre and Vgat-Cre mice with adeno-associated virus carrying 
FLEX-ChR2 into the VTA and implanted a head plate and a microdrive contain- 
ing six tetrodes and an optical fibre. While mice performed a classical conditioning 
task, we recorded spiking activity from VTA neurons. We delivered pulses of light 
to activate ChR2 and classified neurons as dopaminergic, GABAergic or uniden- 
tified. After experiments, we performed immunohistochemistry to localize record- 
ing sites amid dopaminergic neurons. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animals. We used six adult male mice, backcrossed with C57/BL6J mice, 
heterozygous for Cre recombinase under the control of the DAT gene (B6.SJL- 
Sle6a3*™roBKmn/T The Jackson Laboratory)*' and six adult male mice, 
backcrossed with C57/BL6J mice, heterozygous for Cre recombinase under the 
control of the Vgat gene (Vgat-ires-Cre)*?. Animals were housed on a 12-h dark/ 
12-h light cycle (dark from 06:00 to 18:00) and each performed the conditioning 
task at the same time of day, between 07:00 and 18:00. 

Surgery and viral injections. Mice were surgically implanted with a head plate 
and a microdrive containing electrodes and an optical fibre. During surgery, we 
injected 200-500 nl adeno-associated virus (AAV), serotype 5, carrying an 
inverted ChR2 (H134R)-EYFP flanked by double loxP sites'*'*°? into the VTA 
stereotactically (from bregma: 3.1 mm posterior, 0.8 mm lateral, 4-4.5 mm ventral). 
Our expression was highly selective (<1% of ChR2-positive neurons were negative 
for the dopaminergic neuron marker tyrosine hydroxylase, TH) and efficient (>90% 
of TH-expressing neurons co-expressed ChR2; Supplementary Figs 2 and 3). 

For each unidentified type I neuron in DAT-Cre mice, we were unable to 
identify it as dopaminergic using our stringent criteria not because there was no 
response on the electrode after light pulses, but because the response did not match 
the shape of the spontaneous spike waveforms (open blue points in Fig. 3c). This 
may have been from changing the spike shape by opening many ChR2 or due to 
‘spillover’ from nearby neurons, contaminating the signal. 

After injections in Vgat-Cre mice, we confirmed that our ChR2 expression was 
selective (<1% of ChR2-positive neurons were negative for the GABAergic neuron 
marker glutamic acid decarboxylase, GAD65/67) and efficient (but less efficient 
than in DAT-Cre mice; >60% of GAD65/67-expressing neurons co-expressed 
ChR2) (Supplementary Fig. 2). 

For fluorogold tracer injections (Fluorochrome), we injected two animals each 
with 10-25 nl into the following areas: prefrontal cortex (from bregma: 2.1 mm 
anterior, 0.25 mm lateral, 1.6mm ventral), medial striatum (1.4mm anterior, 
1.0mm lateral, 4.0mm ventral) and basolateral amygdala (1.4mm posterior, 
2.8 mm lateral, 4.3 mm ventral). 

Allsurgery was performed under aseptic conditions with animals under ketamine/ 
medetomidine anaesthesia (60/0.5 mg kg’, intraperitoneal, respectively). Analgesia 
(ketoprofen, 5 mgkg”', intraperitoneal) was administered postoperatively. 
Behavioural task. After >1 week of recovery, mice were water-deprived in their 
home cage. Weight was maintained above 90% of their full body weight. Animals 
were head-restrained using a custom-made metal plate and habituated for 1-2 d 
while head-restrained before training on the task. Odours were delivered with a 
custom-made olfactometer**. Each odour was dissolved in paraffin oil at 1/100 
dilution. Thirty microlitres of diluted odour was placed inside a filter-paper hous- 
ing. Odours were isoamyl acetate, 1-butanol, N-butyl acetate, citral, eugenol, (+) 
limonene and (—) carvone, and differed for different animals. Odourized air was 
further diluted with filtered air by 1:10 to produce a 500 ml min‘ ' total flow rate. 
Licks were detected by breaks of an infrared beam placed in front of the water tube. 

We delivered one of four odours, selected pseudorandomly, for 1 s, followed by 

a delay of 1s and an outcome. Each odour predicted a different outcome: a large 
drop of water (3.75 il; valve open for 100 ms), a small drop of water (0.75 ul; valve 
open for 20 ms), no outcome, or an air puff delivered to the animal’s face. Air puff 
trials were included for 17 identified dopaminergic neurons, 14 identified 
GABAergic neurons, and 15 type III neurons. Intertrial intervals (ITIs) were 
drawn from an exponential distribution with a rate parameter of 10 (that is, 
ITI = 10exp(—10x)). This resulted in a flat ITI hazard function, ensuring that 
expectation about the start of the next trial did not increase over time. Data from 
DAT-Cre mice were obtained from 85 sessions (9-23 sessions per animal, 14 + 4.8 
sessions, mean + s.d.). For 17 identified dopaminergic neurons, 14 identified 
GABAergic neurons, and 11 type III neurons, we omitted rewards during 10% 
of big-reward trials. Animals performed between 400 and 1,000 trials per day 
(533 + 120 trials, mean + s.d.). Data from Vgat-Cre mice were obtained from 
71 sessions. 
Electrophysiology. We recorded extracellularly from multiple neurons simulta- 
neously using a custom-built 200-1m-fibreoptic-coupled screw-driven microdrive 
with six implanted tetrodes (four wires wound together). For three Vgat-Cre mice, 
we used a 105-\1m fibre-optic. Tetrodes were glued to the fibre optic with epoxy. 
The ends of the tetrodes were 350-600 jum from the end of the fibre optic. Neural 
signals and time stamps for behaviour were recorded using a DigiLynx recording 
system (Neuralynx). Broadband signals from each wire filtered between 0.1 and 
9,000 Hz were recorded continuously at 32 kHz. To extract the timing of spikes, 
signals were band-pass-filtered between 300-6,000 Hz and sorted online and off- 
line using custom software. 

To verify that our recordings targeted dopaminergic or GABAergic neurons, we 
used ChR2 to observe stimulation-locked spikes*’. The optical fibre was coupled 
with a diode-pumped solid-state laser with analogue amplitude modulation 
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(Laserglow Technologies). Before and after each behavioural session, we delivered 
trains of 10 light pulses, each 5-ms long, at 1, 5, 10, 20 and 50 Hz at 473 nm at 
5-20 mW mm”. We used frequencies of 1, 10, 20, 50 and 100 Hz in Vgat-Cre 
mice. Higher intensities typically resulted in light-evoked spike waveforms that did 
not match spontaneous ones (open blue points in Fig. 3c and open red points in 
Fig. 3d). Therefore, we adjusted the light intensity after observing the responses. 
The increasing latency of light-evoked spiking as a function of stimulation fre- 
quency indicates that the response was not due to photochemical artefact 
(Supplementary Fig. 4f, j). Spike shape was measured using a broadband signal 
(0.1- 9,000 Hz) sampled at 32 kHz. This ensured that particular features of the 
spike waveform were not missed. A previous study found decreasing probability of 
activation with increasing frequency**. This discrepancy may be explained by the 
low levels of light we used to identify neurons and the difference in experimental 
preparation, in vitro versus in vivo’*. 

We used two criteria to include a neuron in our data set. First, the neuron must 
have been well isolated (signal-to-noise ratio of >5 dB). Second, the neuron must 
have been recorded within 500 tm ofan identified dopaminergic neuron (or a type 
I neuron in Vgat-Cre mice), to ensure that all neurons came from VTA. Using 
distance cutoffs of 400, 300 or 200 ttm yielded similar estimates of the proportion 
of type I neurons as the full data set (49 of 91, 49 of 86 and 49 of 82, respectively, in 
DAT-Cre mice). 

Recording sites were further verified histologically with electrolytic lesions using 

15-20s of 100 1A direct current and from the optical fibre track (Supplementary 
Fig. 12). 
Data analysis. To measure firing rates, peristimulus time histograms (PSTHs) 
were constructed using 10-ms bins. To calculate spike density functions, PSTHs 
were convolved with a function resembling a postsynaptic potential*’. To deter- 
mine whether a neuron showed a significant task-related response, we calculated 
an ANOVA on thetrial-by-trial firing rates (spikes s” ') during the baseline period 
(1s before odour onset), CS period (from odour onset to odour offset), delay (from 
odour offset to outcome onset), and US period (from outcome onset to 500 ms 
after outcome onset). The factors were task epoch (that is, baseline, CS, delay, or 
US) and outcome type. All neurons showed a main effect of time; 183 of 187 
neurons showed a main effect of outcome type; 184 of 187 neurons showed a 
time-outcome interaction (P< 0.001). 

Light-evoked spikes were detected during the 10 ms after light onset. If <20% of 
light pulses had a spike (defined as a waveform that matched that of the isolated 
unit) during the 10ms after light onset (lower left points in Fig. 3c, d), the 
maximum voltage during that interval was used as the light-evoked ‘response’. 
Cross-correlations between spontaneous and light-evoked spike waveforms were 
calculated by aligning the positive peak of each waveform, averaging separately, and 
aligning the peaks of the averages. The cross-correlation coefficients reported in 
Fig. 3 are from a lag of 0 ms. The correlation was calculated using the full duration of 
the spontaneous spike (spike duration was measured as the first time until the last 
time at which the voltage was significantly different from baseline using Wilcoxon 
rank sum tests). The energy of the light-evoked waveform is defined as the integral 
of the squared voltage values (|v"dt). To ensure that measurements of the fidelity of 
light-evoked responses were not biased by poorly isolated neurons, we plotted the 
probability of firing at 50 Hz light stimulation against L-ratio, a measure of isolation 
quality** (Supplementary Fig. 13). L-ratio approximates the fraction of ‘contami- 
nated’ spikes; smaller L-ratios indicate better isolation. One hundred and eighty of 
187 neurons in the data set had L-ratios <0.05. 

CS-US indices were calculated as (CS — US)/(CS + US), where CS is the dif- 
ference between the peak firing rate (maximum value of the PSTH) in the 1 s after 
odour onset and the baseline firing rate, and US is the difference between the peak 
firing rate in the 0.5s after reward onset and the baseline firing rate. The baseline 
firing rate was calculated as the mean of the PSTH in the 0.5 s before odour onset. 

ROC curves were calculated by comparing the distribution of firing rates across 
trials in 100-ms bins to the distribution of baseline firing rates (900 ms before 
odour onset; Supplementary Fig. 1). PCA was calculated using the singular value 
decomposition of the auROC. Learning was measured using the lick responses 
(mean licks s_') across consecutive days of behavioural sessions. Curves were fit 
with logistic functions, k/(1+exp(—1f)). The task was considered fully learned if the 
learning rate was within 95% of the carrying capacity of the logistic (k). All 12 
animals learned the task within the first 3 days. 

Hierarchical clustering was done using the first three PCs of the auROC curves 
using a Euclidean distance metric and complete agglomeration method. 

All statistical tests were done with corrections for multiple comparisons 
(Bonferroni or Tukey). Analyses were done with R (http://www.r-project.org/). 
Immunohistochemistry. After recording, which lasted between 9 and 23 days, 
mice were given an overdose of ketamine/medetomidine, exsanguinated with 
saline, perfused with paraformaldehyde, and brains were cut in 50m coronal 
or horizontal sections. Sections were immunostained with antibodies to TH and 
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secondary antibodies labelled with Cy3 (Jackson Immunoresearch). Sections were 
further stained with 4',6-diamidino-2-phenylindole (DAPI) or TO-PRO-3 
(Invitrogen) to visualize nuclei. Recording sites were identified and verified to 
be amid EYFP staining and TH staining in VTA. Brain slices from additional 
virus-injected DAT-Cre and Vgat-Cre animals were stained for TH and GAD- 
65/67 (Millipore) with secondary antibodies labelled with Cy3 or Alexa 594 
(Invitrogen). Cellular fluorescence intensities of ChR2-EYFP were measured using 
Volocity Image Analysis Software (Perkin Elmer). 
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Exercise-induced BCL2-regulated autophagy is 
required for muscle glucose homeostasis 


Congcong He!*3*, Michael C. Bassik*+*, Viviana Moresi’, Kai Sun”°, Yongjie Wei”, Zhongju Zou’, Zhenyi An"?, Joy Loh’, 
Jill Fisher*, Qihua Sun'’, Stanley Korsmeyer*}, Milton Packer’, Herman I. May’, Joseph A. Hill’, Herbert W. Virgin’, 
Christopher Gilpin’, Guanghua Xiao®, Rhonda Bassel-Duby”, Philipp E. Scherer”® & Beth Levine’?3° 


Exercise has beneficial effects on human health, including protec- 
tion against metabolic disorders such as diabetes’. However, the 
cellular mechanisms underlying these effects are incompletely 
understood. The lysosomal degradation pathway, autophagy, is an 
intracellular recycling system that functions during basal conditions 
in organelle and protein quality control’. During stress, increased 
levels of autophagy permit cells to adapt to changing nutritional and 
energy demands through protein catabolism’. Moreover, in animal 
models, autophagy protects against diseases such as cancer, neuro- 
degenerative disorders, infections, inflammatory diseases, ageing 
and insulin resistance**. Here we show that acute exercise induces 
autophagy in skeletal and cardiac muscle of fed mice. To investigate 
the role of exercise-mediated autophagy in vivo, we generated 
mutant mice that show normal levels of basal autophagy but are 
deficient in stimulus (exercise- or starvation)-induced autophagy. 
These mice (termed BCL2 AAA mice) contain knock-in mutations 
in BCL2 phosphorylation sites (Thr69Ala, Ser70Ala and Ser84Ala) 
that prevent stimulus-induced disruption of the BCL2-beclin-1 
complex and autophagy activation. BCL2 AAA mice show decreased 
endurance and altered glucose metabolism during acute exercise, 
as well as impaired chronic exercise-mediated protection against 
high-fat-diet-induced glucose intolerance. Thus, exercise induces 
autophagy, BCL2 is a crucial regulator of exercise- (and starvation)- 
induced autophagy in vivo, and autophagy induction may con- 
tribute to the beneficial metabolic effects of exercise. 

To investigate whether autophagy is induced by exercise, we ana- 
lysed tissues of mice that transgenically express a green fluorescent 
protein (GFP)-labelled marker of autophagosomes, GFP-LC3 (LC3 is 
also known as MAP1LC3) (ref. 7), after treadmill exercise. In both 
skeletal and cardiac muscle, autophagosome (GFP-LC3 puncta) num- 
bers increased after 30 min (~300 m) of running and reached a plateau 
at 80 min (~900 m) (Fig. la—c). This was observed in several muscle 
groups, including the vastus lateralis (thigh muscle) (Fig. la—c), soleus, 
tibialis anterior and extensor digitorum longus (Supplementary Fig. 1). 
Exercise also resulted in biochemical evidence of skeletal and cardiac 
muscle autophagy, including conversion of the non-lipidated form of 
LC3, LC3-I, to the autophagosome-membrane-associated lipidated 
form, LC3-II, and degradation of the autophagy substrate protein 
p62 (Fig. 1d). In addition, exercise induced autophagy in other organs 
involved in glucose and energy homeostasis, such as liver and pancreas, 
including islet B-cells (Supplementary Fig. 2a-f) and (based on LC3-II 
conversion) adipose tissue (Supplementary Fig. 2g). Thus, exercise is a 
newly defined stimulus that induces autophagy in vivo. 


We found that exercise-induced autophagy in vivo involves disrup- 
tion of the BCL2-beclin-1 complex. BCL2 is an anti-apoptotic and 
anti-autophagy protein that inhibits autophagy through a direct inter- 
action with the BH3 domain of the autophagy protein beclin 1 at the 
endoplasmic reticulum*. Disruption of the BCL2-beclin-1 complex is 
crucial for stimulus-induced autophagy in mammalian cells*. Beclin 1 
immunoprecipitation with BCL2 decreased in muscle within 15 min of 
exercise and was barely detectable after 30 min (Fig. le). Dissociation 
of the BCL2-beclin-1 complex was not associated with phosphoryla- 
tion of kinases previously linked to BCL2 phosphorylation, such as 
JNK and p38 MAPK? (Supplementary Fig. 3). 

To study the physiological functions of exercise-induced autophagy 
in vivo, we constructed mutant mice that are deficient in stimulus- 
induced but not basal autophagy. As phosphorylation of three sites in 
the non-structured loop of human BCL2 is critical for stimulus- 
induced autophagy in vitro’, we generated knock-in (Bcl2“*“) mice 
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Figure 1 | Exercise induces autophagy in skeletal and cardiac muscle. 

a, b, Representative images of GFP-LC3 puncta (autophagosomes) in skeletal 
(vastus lateralis) (a) and cardiac (b) muscle from GFP-LC3 transgenic mice at 
serial time points after exercise. Scale bar, 20 um. c, Quantification of data 
(mean + s.e.m. of 10 tissue sections) ina and b. **P < 0.01, ***P < 0.001 (one- 
way ANOVA). d, Western blot analysis of LC3-I/II (non-lipidated and 
lipidated forms of MAP1LC3, respectively) and p62 levels in indicated tissue 
from mice at rest (—) or after maximal exercise (+). Skeletal and cardiac 
indicate skeletal and cardiac muscle, respectively. e, Co-immunoprecipitation 
of beclin 1 with BCL2 in muscle tissue from mice at indicated time points after 
exercise. IP, immunoprecipitate; WB, western blot; WCL, whole cell lysates. 
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lacking three conserved phosphorylation residues in the non-structured 
loop region of mouse BCL2: Thr 69, Ser 70 and Ser 84 (homologous to 
human Ser 87) were replaced by alanines (BCL2 AAA) (Supplementary 
Fig. 4a—c). Homozygous Bcl2“*“ mice were viable and fertile, born in 
the expected Mendelian ratio, of normal size and weight, and displayed 
normal histology of major organs (data not shown). Steady-state levels 
of BCL2 AAA in Bcl2**4 mice were similar to those of wild-type (WT) 
BCL2 in Bcl2“* mice in skeletal and cardiac muscle, liver, adipose tissue 
and pancreas (Supplementary Fig. 4d). 

Murine embryonic fibroblasts (MEFs) derived from BCL2 AAA mice 
showed a defect in stimulus-induced autophagy (Fig. 2a, b). In response 
to starvation, BCL2 underwent phosphorylation, BCL2 dissociated from 
beclin 1, and autophagy was induced in MEFs from isogenic wild-type 
mice. However, in BCL2 AAA MEFs, BCL2 phosphorylation was absent 
(as measured by p32 labelling), the BCL2-beclin-1 complex was not 
disrupted, and less autophagy was induced. Levels of basal autophagy 
were similar in wild-type and BCL2 AAA MEFs. Similar findings were 
observed in vivo in skeletal and cardiac muscle (Supplementary Fig. 5); 
autophagosome numbers were similar at baseline in wild-type and BCL2 
AAA mice expressing GFP-LC3 but failed to increase in GFP-LC3 
BCL2 AAA mice in response to 48 h starvation. 

To evaluate whether BCL2 AAA mice are deficient in exercise- 
induced autophagy, we exercised GFP-LC3 wild-type mice and 
GFP-LC3 BCL2 AAA mice for a fixed time and fixed distance 
(80 min (~900m)) and at 75% of their maximal running capacity 
(see Fig. 3a). Under both conditions, BCL2 AAA mice displayed 
marked impairment of exercise-induced upregulation of skeletal and 
cardiac muscle (as measured by numbers of GFP-LC3 puncta) (Fig. 2c, 
d and Supplementary Fig. 1b). BCL2 AAA mice also exhibited less 
exercise-induced LC3-II conversion and p62 degradation in skeletal 
and cardiac muscle (Fig. 2e), impaired autophagic responses in liver 
and pancreatic B-cells (Supplementary Fig. 2c, f), and defective exercise- 
induced dissociation of the BCL2-beclin-1 complex in muscle (Fig. 2f). 
Thus, non-phosphorylatable BCL2 does not alter basal autophagy in 
vivo, but prevents autophagy activation in response to starvation and 
exercise. This blockade of autophagy activation was not associated with 
increased cell death (Supplementary Fig. 6). 


The deficient exercise-induced autophagy in BCL2 AAA mice was 
accompanied by lower maximal exercise capacity (Fig. 3a). BCL2 AAA 
mice had similar levels of serum lactate after maximal exercise, similar 
baseline muscle strength, and similar baseline cardiac function and 
size as wild-type mice (Supplementary Fig. 7). Characterization of 
various muscle groups (soleus, tibialis anterior and extensor digitorum 
longus) from wild-type and BCL2 AAA mice at baseline and after 
80 min of exercise revealed no differences in weight, fibre cross-sectional 
area, morphology (haematoxylin and eosin staining), fibre type analysis, 
glycogen content, or mitochondrial content and functionality (Fig. 3b 
and Supplementary Figs 8-10). These analyses suggest that differences in 
baseline cardiac and skeletal muscle properties do not account for 
decreased exercise endurance in BCL2 AAA mice. 

During strenuous exercise, mammals undergo metabolic changes to 
increase skeletal muscle glucose uptake/utilization efficiency, includ- 
ing increased insulin sensitivity and redistribution of glucose trans- 
porters, such as GLUT4 (also known as SLC2A4), to the plasma 
membrane’™"', which is essential for exercise-stimulated glucose 
uptake’. However, BCL2 AAA mice demonstrated impaired exercise- 
induced increase in insulin sensitivity, as reflected by less of a decline in 
plasma glucose (Fig. 3c) and plasma insulin (Fig. 3d) levels than in 
wild-type mice. Also, unlike wild-type mice, BCL2 AAA mice failed to 
exhibit increased plasma membrane GLUT4 localization in the vastus 
lateralis and soleus muscles following maximal or 80 min exercise, 
respectively (Fig. 3e and Supplementary Figs 1la and 12a) and they 
exhibited lower levels of radiolabelled glucose uptake in soleus muscle 
(Fig. 3h). 

Adenosine monophosphate-activated protein kinase (AMPK) has a 
central role in enhanced GLUT4 plasma membrane localization and 
skeletal muscle glucose uptake during exercise’*. BCL2 AAA mice had 
a notable decrease in AMPK activation, as measured by levels of 
phosphorylation at residue threonine 172, in both the vastus lateralis 
(Fig. 3f and Supplementary Fig. 11b) and in the soleus (Supplementary 
Fig. 12b) muscles. Phosphorylation of the downstream AMPK target 
acetyl-CoA carboxylase (ACC) paralleled AMPK phosphorylation 
(Supplementary Fig. 13), providing additional support for a defect in 
exercise-induced AMPK activation in BCL2 AAA mice. 
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Figure 2 | Non-phosphorylatable BCL2 AAA knock-in mutations block 
BCL2 phosphorylation, BCL2-beclin 1 dissociation, and starvation- and 
exercise-induced autophagy. a, Analysis of BCL2 phosphorylation (detected 
by anti-BCL2 immunoprecipitation and autoradiography of *’P-labelled cells) 
and beclin 1 co-immunoprecipitation with BCL2 in wild-type (WT) or BCL2 
AAA MEFs grown in normal media or subjected to 4h Earle’s balanced salt 
solution (EBSS) starvation. p-BCL2, phospho-BCL2. b, Quantification of GFP- 
LC3 puncta (autophagosomes) in MEFs of indicated genotype in normal 
growth conditions or starvation conditions. Data represent mean + s.e.m. for 
100 cells per well of triplicate samples per condition. Similar results were 
observed in three independent experiments. c, Representative images of GFP- 
LC3 puncta (autophagosomes) in skeletal and cardiac muscle of GFP-LC3 
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wild-type and GFP-LC3 BCL2 AAA mice before exercise, after 80 min exercise, 
or after 75% of maximal exercise capacity. Scale bar, 20 um. d, Quantification of 
data (mean ~ s.d. of 4 mice per experimental group) in c. e, Western blot 
analysis of LC3-I/II and p62 levels in indicated tissue from mice of indicated 
genotype at rest (—) or after maximal exercise (+). Skeletal and cardiac indicate 
skeletal and cardiac muscle, respectively. f, Co-immunoprecipitation of beclin 1 
with BCL2 in muscle tissue from mice of indicated genotype at rest (—) or after 
30 min of exercise. WCL, whole cell lysates. NS, not significant. *P < 0.05, 
**P < 0.01, ***P < 0.001, one-way ANOVA for comparison between groups; 
+P <0.001, two-way ANOVA for comparison of magnitude of changes 
between different groups in mice of different genotypes. 
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Figure 3 | BCL2 AAA mice show deficient exercise endurance and 
alterations in muscle glucose metabolism. a, Maximal treadmill running 
distance for mice of indicated genotype. Data represent mean + s.e.m. of 5 mice 
per group. b, Representative haematoxylin and eosin (H & E) and periodic acid- 
Schiff (PAS) staining in tibialis anterior muscle sections from mice of indicated 
genotype. Scale bar, 20 1m. ¢, d, Plasma glucose (c) and insulin (d) levels in 
mice of indicated genotype at rest, after 80 min exercise (~900 m), or maximal 
exercise. Data represent combined mean = s.e.m. for 9-11 mice per group from 
three independent cohorts; similar results were observed in each cohort. 

e, Representative images of GLUT4 immunofluorescence staining in vastus 
lateralis muscle of mice of indicated genotype before and after maximal 
exercise. Scale bar, 20 ttm. For b and e, similar results were observed in 3 mice 
per group. f, Western blot analysis of AMPK phosphorylation (p-AMPK 
(Thr 172)) in vastus lateralis muscle lysates from mice of indicated genotype at 
indicated time after exercise. g, Maximal treadmill running distance for mice of 
indicated genotype. Data represent mean + s.e.m. of 4-6 male mice per group. 
h, Soleus muscle '*C-deoxyglucose uptake during treadmill exercise in mice of 
indicated genotype. Data represent mean + s.e.m. of 3 mice per group. 

*P < 0.05, **P < 0.01, ***P < 0.001, one-way ANOVA for comparison 
between groups; P < 0.05, *+P < 0.01, two-way ANOVA for comparison of 
magnitude of changes between different groups in mice of different genotypes. 
NS, not significant. 
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No differences were observed in phosphorylation of CaMK1, a 
downstream target of the Ca’*-stimulated kinase CaMKKf (which 
is upstream of AMPK in certain contexts)!’ (Supplementary Fig. 13b), 
suggesting that differences in levels of AMPK activation between wild- 
type and BCL2 AAA mice are not due to altered Ca” homeostasis. 
They are also probably unrelated to possible effects of BCL2 on 
mitochondrial function, because phosphorylated BCL2 resides at the 
endoplasmic reticulum” and markers of mitochondrial function did 
not differ in the muscles of BCL2 AAA and wild-type mice (Sup- 
plementary Fig. 10). Differences in insulin signalling molecules also 
did not explain the altered glucose homeostasis in exercised BCL2 
AAA mice, because at late time points during exercise, decreased 
(not increased) AKT activation and mTOR activation was observed 
in muscle of both wild-type and BCL2 AAA mice (Supplementary 
Fig. 13a). No differences were observed in muscle phosphorylation 
of the AKT substrate AS160 (also known as TBC1D4), or levels of 
the muscle-specific forkhead transcription factor target atrogin 1 (also 
known as FBXO32), in response to exercise or between genotypes. 

To confirm that the observed defects in exercise-induced endurance 
and muscle glucose metabolism are due to defective autophagy, we 
examined mice with allelic loss of the autophagy gene beclin 1 (Becn1), 
which have decreased beclin 1 protein expression and decreased 
starvation-induced autophagy in skeletal muscle’. Similar to BCL2 
AAA mice, Becn1*’~ mice exhibited deficient exercise-induced 
autophagy in skeletal muscle (Supplementary Fig. 14a, b), normal 
muscle strength (Supplementary Fig. 14c), decreased maximal 
treadmill running distance (Fig. 3g), impaired exercise-induced 
GLUT4 plasma membrane localization (Supplementary Fig. 14d) 
and muscle glucose uptake (Fig. 3h), and lower levels of exercise- 
induced muscle AMPK activation (phospho-AMPK and its target, 
phospho-ACC) (Supplementary Fig. 14e) than wild-type controls. 
The similarity of the phenotypes of BCL2 AAA and Becn1*'~ mice 
provides strong support for a role of deficient beclin 1 activity, rather 
than other BCL2-regulated functions, in the impairment of exercise 
endurance, glucose uptake and AMPK activation in BCL2 AAA mice. 

AMPK activates autophagy in response to low energy conditions by 
sensing the cellular ratio of AMP to ATP”. Deficient AMPK activation 
in BCL2 AAA and Becn1*’~ mice during exercise raised the possibility 
that, similar to observations in vitro’®, the autophagy pathway may 
function in vivo in a feed-forward manner in AMPK activation. To 
evaluate whether impaired exercise-induced AMPK activation in vivo 
is due to deficient autophagy, rather than potential autophagy- 
independent effects of the BCL2 AAA mutation or monoallelic loss 
of beclin 1, we examined exercise-induced AMPK activation in 
Atgl6l1"™ mice that are hypomorphic for ATGI6L1 (ref. 17), an 
essential autophagy protein that specifies the LC3 lipidation site for 
autophagosomal membrane biogenesis'®. Atg16l1"™ mice had a sig- 
nificant reduction in ATG16L1 protein expression in skeletal muscle 
and a defect in exercise-induced autophagy in muscle lysates after 
80 min of exercise (Supplementary Fig. 15a, b). This defect in exercise- 
induced autophagy was associated with decreased exercise-induced 
AMPK phosphorylation and ACC phosphorylation (Supplementary 
Fig. 15c). 

Together, our in vivo studies in BCL2 AAA, Becn1*'~ and 
Atgi6l1'™ mice suggest that cellular autophagy function is partially 
required for normal levels of exercise-induced muscle AMPK activa- 
tion. The mechanism underlying this requirement is not yet known, but 
is unlikely to be caused by non-specific effects of the mouse mutations 
on AMPK activation as the direct AMPK activator, the AMP analogue 
5-aminoimidazole-4-carboxamide riboside (AICAR), induced AMPK 
activation normally in BCL2 AAA, Becn1*’~ and Atg16l1"™ MEFs 
(Supplementary Fig. 16). 

We next examined whether exercise-induced autophagy is required 
for the beneficial metabolic effects of long-term exercise training, using a 
high-fat diet (HFD) model of obesity and impaired glucose tolerance*””. 
Age-matched cohorts of wild-type and BCL2 AAA mice were fed a HFD 


00 MONTH 2012 | VOL 000 | NATURE | 3 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


for four weeks; the HFD was continued for an additional 8 weeks either 
combined with 50 min of daily treadmill exercise or with no exercise 
(Supplementary Fig. 17a). In both wild-type and BCL2 AAA mice, HFD 
led to body weight gain (Supplementary Fig. 17b), which was probably 
due to increased fat mass (Supplementary Fig. 18a), as no differences in 
lean mass or muscle fibre size were observed (Supplementary Fig. 18). 
HED led to modest suppression of baseline autophagy in skeletal and 
cardiac muscle as measured by levels of LC3-II conversion and p62 in 
tissue lysates (Supplementary Fig. 19). Ultrastructural analyses of the 
liver and pancreas did not reveal any differences in HFD versus regular 
diet animals, in wild-type versus BCL2 AAA mice, or in non-exercised 
versus exercised animals, except for hepatic lipid droplet accumulation 
in all HFD-fed groups (Supplementary Fig. 20). Although BCL2 AAA 
mice were more sensitive to HFD-induced obesity, 8 weeks of daily 
exercise reduced the magnitude of weight gain similarly in BCL2 
AAA and wild-type mice (Supplementary Fig. 17b). Thus, the BCL2 
AAA mutation did not alter the response of mice to HFD with respect to 
muscle fibre size, the morphology of liver and pancreas, or the effect of 
exercise on HFD-induced obesity. 

However, BCL2 AAA mice failed to exhibit normal exercise- 
induced protection against HFD-induced impaired glucose tolerance. 
Before the administration of HFD, wild-type and BCL2 AAA mice had 
similar clearance rates in oral glucose tolerance tests (Fig. 4a). Four 
weeks of HFD led to impaired glucose tolerance in both genotypes 
versus regular diet control groups (Fig. 4b); the magnitude of this 
impairment was similar in BCL2 AAA and wild-type mice despite 
more weight gain in the BCL2 AAA mice. However, exercise training 
markedly improved glucose tolerance in HFD-fed wild-type mice but 
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Figure 4 | Long-term exercise training protects wild-type but not BCL2 
AAA mice from HFD-induced glucose intolerance. a, b, Oral glucose 
tolerance test (OGTT) before (a, week 0) and after (b, week 4) 4 weeks of HFD. 
c,d, OGTT (c) and serum leptin and adiponectin levels (d) after 8 weeks of daily 
exercise. For a-d, results represent the mean + s.e.m. for 4-5 mice per group. E, 
exercise; NE, no exercise; RD, regular diet .*P < 0.05, **P < 0.01, (c, one-way 
ANOVA; d, Wilcoxon rank test). NS, not significant. 
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not in autophagy-deficient BCL2 AAA mice (Fig. 4c and Supplemen- 
tary Fig. 21). This failure of BCL2 AAA mice to show improved glucose 
tolerance is unlikely to be due to deficient insulin production, because 
at the end of the study HFD-fed wild-type and BCL2 AAA mice dis- 
played similar circulating fasting insulin levels (Supplementary Fig. 22a), 
similar levels of insulin secretion in response to oral glucose challenge 
(Supplementary Fig. 22b), and similar pancreatic B-cell morphology 
(Supplementary Fig. 20b). These results suggest that BCL2-regulated 
functions are essential for chronic exercise-mediated protection against 
HFD-induced glucose intolerance. We speculate that, as in acute 
exercise, this may involve the dynamic interplay of autophagy induc- 
tion and AMPK activation. 

HFD-induced obesity is associated with several other metabolic 
changes, including increased levels of serum leptin, an appetite- 
inhibiting adipokine;” reduced levels of circulating adiponectin”', an 
antidiabetic adipokine*~*; and increased serum triglycerides and 
cholesterol. In HFD-fed mice, baseline levels of leptin were higher in 
BCL2 AAA mice than in wild-type mice, and 8 weeks of exercise 
decreased serum leptin in wild-type but not in BCL2 AAA mice 
(Fig. 4d). Although adiponectin levels in HFD-fed mice were similar 
in the non-exercised wild-type and BCL2 AAA groups, they were sig- 
nificantly increased in the exercised wild-type mice versus the exercised 
BCL2 AAA mice (Fig. 4d). Both serum triglycerides and cholesterol 
increased in wild-type and BCL2 AAA HFD-fed mice; with exercise, 
the values returned to those observed in animals fed a regular diet in 
wild-type but not BCL2 AAA mice (Supplementary Fig. 23). Thus, the 
BCL2 AAA mutation impaired the beneficial effects of exercise on 
metabolic changes induced by a HFD. 

The HFD-fed, daily exercised wild-type mice were more metabolically 
active than their BCL2 AAA counterparts, as demonstrated by elevated 
levels of oxygen consumption, CO, production and heat genera- 
tion during a 12h night interval when the mice were not exercised 
(Supplementary Fig. 24a). In addition, in vastus lateralis muscle, 
wild-type mice had elevated induction of messenger RNA for UCP1, 
Ucp1, a mitochondrial uncoupling protein mainly expressed in brown 
adipose tissue that contributes to thermogenesis and energy expend- 
iture’> (Supplementary Fig. 24d). These changes were probably due to 
improved metabolic fitness in response to exercise rather than altera- 
tions in food intake, spontaneous physical activity, or general 
mitochondrial function. No differences were observed in daily food 
intake in exercised HFD-fed wild-type and BCL2 AAA mice (Sup- 
plementary Fig. 24b). Spontaneous physical activity was similar in 
HFD-fed non-exercised wild-type and BCL2 AAA mice, although there 
was a trend towards decreased spontaneous physical activity in the 
exercised BCL2 AAA mice (Supplementary Fig. 24c). No changes were 
observed with exercise in either genotype in mRNA levels of the skeletal 
muscle-expressed uncoupling proteins UCP2 or UCP3, or the mito- 
chondrial proteins cytochrome b and c (Supplementary Fig. 24d, e). 

The HFD study suggests that increased autophagy triggered by 
exercise may be critical for improving impaired glucose tolerance 
and metabolism in diet-induced obesity. However, we cannot definitely 
conclude that lack of exercise-induced improvement in glucose 
tolerance in HFD-fed BCL2 AAA mice is caused by deficient exercise- 
induced autophagy; it is possible that other effects of the BCL2 AAA 
mutation are responsible for this phenotype. Nonetheless, given our 
findings in acute exercise (which demonstrate impaired muscle glucose 
uptake, GLUT4 plasma membrane localization and AMPK activation 
in autophagy-deficient animals), it seems plausible that alterations in 
exercise-induced skeletal muscle glucose metabolism in autophagy- 
deficient animals may also contribute to the failure of exercise to reverse 
HFD-induced metabolic abnormalities. 

Our findings demonstrate that exercise is a potent inducer of 
autophagy, and that acute and chronic exercise enhances glucose 
metabolism in mice capable of inducing autophagy but not in 
autophagy-deficient mice. These beneficial metabolic effects (as well 
as exercise- and starvation-induced autophagy) are blocked by a 
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mutation in BCL2 that prevents its release from an inhibitory inter- 
action with the autophagy protein beclin 1. Thus, BCL2 has previously 
undescribed essential roles in the in vivo regulation of stimulus- 
induced autophagy as well as glucose metabolism. We propose that 
BCL2-regulated autophagy activation contributes to the beneficial 
metabolic effects of exercise, and that manipulation of the autophagy 
pathway and/or the function of the autophagy inhibitory BCL2 protein 
may be a logical strategy to mimic the health effects of exercise and to 
prevent or treat impaired glucose metabolism. More broadly, on the 
basis of this newly discovered link between exercise, autophagy and 
altered metabolism, we speculate that autophagy may represent a 
cellular mechanism by which exercise prolongs life and protects 
against cancer, cardiovascular disorders and inflammatory diseases’. 


METHODS SUMMARY 

Mouse strains. GFP-LC3 transgenic’, Becn1 */~ (ref. 15) and Atgléll HM (ref. 17) 
mice have been described. The generation of mice with a knock-in mutation in 
Bcl2 that results in Thr69Ala, Ser70Ala and Ser84Ala mutations in BCL2 (BCL2 
AAA) is described in Methods. 

Exercise studies. Acute and chronic exercise studies were performed using a 
treadmill protocol described in Methods. 

Autophagy assays. Autophagy was measured by visualizing GFP-LC3 puncta by 
fluorescence microscopy in MEFs or tissue sections from mice with transgenic 
expression of GFP-LC3, or by western blot analysis of LC3-II conversion and p62 
levels in tissue lysates. 

Metabolic analyses. Metabolic and blood parameters in the acute and chronic 
exercise studies were measured as described in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Mouse strains. GFP-LC3 transgenic, Becn1*’~ and Atgi6l1'™ mice have been 
previously described”’*’’. For the construction of a mouse strain with knock-in 
mutations in the phosphorylation sites in the non-structured loop of BCL2 
(Bcl2*““ mice), BAC clones (Incyte) were screened for the presence of Bcl2 using 
the following primers: BCL2 5’, GIGGGGCGGGAGTCGGGACT; BCL2 3’, 
GACCCAGAATCCACTCACAC. The Bcl2 BAC clone was digested by BglII, 
subcloned into pSP72, and a puromycin resistance marker flanked by FRT sites 
(SalI fragment of pPGKPuro, cloned into pFRT) was blunt cloned into a BsaBI site 
3’ of exon IL. A mutantallele was generated by subcloning a PstI fragment from the 
coding region. Amino acid residues Thr 69, Ser 70 and Ser 84 (homologous to 
human Ser 87) were changed to alanine in two steps using the Quickchange 
PCR (Stratagene) and the following primer sets: TSAA 5’, GAGATGGCTGCCA 
GGGCGGCTCCTCTCAGGCCCC; TSAA 3’, GGGGCCTGAGAGGAGCC 
GCCCTGGCAGCCATCTCG; $84A 5’, GCTGGGCCTGCGCTCGCCCC TGTGC 
CACCATG; S84A 3’, CATGGTGGCACAGGGGCGAGCGCAGGCCCAGC. 
The targeting construct 7.1AAA was electroporated into RW4 embryonic stem 
cells (129 Sv/J), and 36h later, clones were selected with puromycin, picked and 
screened by Southern blot analysis with the probes indicated in Supplementary 
Fig. 4a. A 1.7kb shift upwards indicative of the mutant allele was detected on 
Southern blots in 2/300 clones as the targeted locus (Supplementary Fig. 4b). 
These clones were tested for normal karyotype and used to inject blastocysts 
from C57BL/6J donors. Mice with germline transmission were bred to mice 
expressing FLP from the f-actin promoter (Jackson Laboratories: B6; 
SJLTgN(ACTFLPe)9205Dym), and offspring were screened for deletion of the 
puromycin selection marker. To screen for the presence of the mutation following 
excision of the selectable marker, PCR with 5’ and 3’ Bcl2 primers was performed, 
and digestions were performed with Bgll or AatII enzymes; Bgl] cuts the amplified 
fragment only if the T69AS70A site is present, while AatlII cuts only the wild-type 
sequence for the same region (Supplementary Fig. 4c). Using this scheme, 
homozygous Bcl2*** mutants were identified by the presence of PCR fragments 
cleaved only by Bgll and not AatII, whereas wild-type mice were identified by the 
presence of PCR fragments cleaved only by AatII and not Bgll. Bcl2**“ mice were 
backcrossed for more than ten generations to C57/B6 mice (Jackson Laboratories), 
and homozygous Bcl2“*“ and Bcl2" offspring were used in all studies. 
Cell lines. Primary MEFs were established either from Bcl2**“ and Bcl2“* mice, 
the offspring of Bcl2*** and Bcl2" mice crossed with GFP-LC3 transgenic mice, 
Becn1*'~ and Becn1*'* mice, and Atgi6l1'™ and Atgi6l1" at embryonic day 
13.5 and cultured as described”. 
Radiolabelling and co-immunoprecipitation from MEFs. MEFs derived from 
Bcl2“*4 and Bcl2“' mice were cultured overnight in labelling medium (phosphate- 
free DMEM with 10% dialysed FBS) and then cultured in starvation medium (EBSS 
minus phosphate) or labelling medium for 4h containing 2mCiml' *?P- 
orthophosphate. Cells were lysed in lysis buffer containing 50 mM Tris (pH 7.9), 
150 mM NaCl, 1 mM EDTA, 1% Triton X-100, proteinase inhibitor cocktail (Roche 
Applied Sciences) and halt phosphatase inhibitor cocktail (Thermo Scientific), and 
were subjected to immunoprecipitation with a monoclonal anti-BCL2 antibody 
(Santa Cruz Biotechnology, 1:50). Eluates were separated by SDS-PAGE and 
detected by anti-beclin 1 antibody (Santa Cruz Biotechnology, 1:200), anti- 
BCL2-HRP antibody (C2, Santa Cruz Biotechnology, 1:100) and autoradiography. 
Co-immunoprecipitations from muscle tissue. Vastus lateralis (thigh muscle) 
was dissected and homogenized in lysis buffer containing 50 mM Tris (pH 7.9), 
150mM NaCl, 1mM EDTA, 1% Triton X-100, proteinase inhibitor cocktail 
(Roche Applied Sciences) and halt phosphatase inhibitor cocktail (Thermo 
Scientific), and subjected to immunoprecipitation with a monoclonal anti-BCL2 
antibody (Santa Cruz Biotechnology, 1:50). Eluates were separated by SDS-PAGE 
and detected by anti-beclin 1 antibody (Santa Cruz Biotechnology, 1:200) and anti- 
BCL2-HRP antibody (C2, Santa Cruz Biotechnology, 1:100) and autoradiography. 
Mouse exercise studies. For acute exercise studies, 8-week-old (wild type, BCL2 
AAA and Atgl 611™™) or 12-week-old (Becn1*/~ and wild-type Becn1*/* 
littermate) mice were acclimated to and trained on a 10° uphill Exer 3/6 open 
treadmill (Columbus Instruments) for 2 days. On day 1 mice ran for 5 min at 
8mmin ' and on day 2 mice ran for 5 min at 8mmin_ | followed by another 
5 min at 10mmin *. On day 3, mice were subjected to a single bout of running 
starting at the speed of 10mmin_'. Forty minutes later, the treadmill speed was 
increased at a rate of 1!mmin ! every 10 min for a total of 30 min, and then 
increased at the rate of l1mmin | every 5min until mice were exhausted. 
Exhaustion was defined as the point at which mice spent more than 5s on the 
electric shocker without attempting to resume running. Total running time was 
recorded and total running distance was calculated for each mouse. 

For long-term exercise training, male Bcl2“" or Bcl2*“* mice (8 weeks old) 
were randomly divided into three cohorts, including: (1) mice fed a regular diet 
(Harlan Teklad) without daily exercise, (2) mice fed a HFD containing 60% fat 


(Research Diets) without daily exercise, and (3) mice fed a HFD with daily exercise. 
Prior to initiation of exercise, mice were fed a HED for 4 weeks. At the end of the 
fourth week, mice in the exercise groups were acclimated to treadmill running for 2 
days as in the acute exercise studies, and then were trained on the treadmill with 
10° uphill incline for 50 mind” ', 5 d week! at 17m min‘ for 8 weeks. Mice were 
given a HFD during the 8-week training period. All animal procedures were 
performed in accordance with institutional guidelines and with approval from 
the Institutional Animal Care and Use Committee. 

Autophagy analyses. MEFs expressing GFP-LC3 were cultured in normal or 
starvation (EBSS, Earle’s balanced salt solution) medium for 4h, and GFP-LC3 
puncta were quantified by fluorescence microscopy as described’. For assessment 
of autophagy in vivo following starvation or exercise, 8-week-old Bcl2 or Bcl2**“ 
GFP-LC3 mice or Becn1*’* or Becn1*'~ GFP-LC3 mice were either subjected to 
starvation for 48h or exercised for the indicated time period, anaesthetized by 
isoflurane, and perfused with 4% paraformaldehyde (PFA). Skeletal muscle (vastus 
lateralis, tibialis anterior, extensor digitorum longus and soleus), heart (left 
ventricle), pancreas and liver tissues were fixed in 4% PFA overnight, 15% sucrose 
for 4h and 30% sucrose overnight before frozen sections were prepared. The 
number of GFP-LC3 puncta per unit area of tissue was quantified by fluorescence 
microscopy as described’’. Autophagy in skeletal and cardiac muscle, liver, 
pancreas and adipose tissue during baseline conditions and after exercise was also 
analysed by western blot analysis of tissue extracts with antibodies against LC3 and 
p62 (see below for details). 

Immunofluorescence studies. For immunofluorescence staining of frozen muscle 
and pancreatic sections, slides were heated at 50 °C for 10 min, rehydrated in 100%, 
95% and 70% ethanol, washed 3 X 5 min in PBS, blocked with 1% goat serum and 
immunostained with a rabbit anti-mouse GLUT4 antibody (Alpha Diagnostic, 
1:100 dilution) (muscle samples) and an Alexa Fluor 594 donkey anti-rabbit 
secondary antibody (Invitrogen, 1:1,000 dilution) or with a guinea pig anti-swine 
insulin antibody (Dako, 1:500 dilution) (pancreatic samples) and an Alexa Fluor 
594 goat anti-guinea pig secondary antibody (Invitrogen, 1: 500 dilution). 
Immunofluorescence studies on skeletal muscle sections with anti-laminin and 
anti-type I myosin heavy chain antibodies were performed as previously 
described’*. Immunofluorescence images were taken using a Zeiss Axioplan2 
microscope or Leica TCS SP5 confocal microscope. 

Muscle morphology and function studies. Sections were stained with haematoxylin 
and eosin to visualize tissue architecture following standard protocols. 
Metachromatic ATPase staining and measurement of fibre cross-sectional area 
were performed as previously described”*. Periodic acid-Schiff staining for carbo- 
hydrates was performed according to standard protocols”. Succinic dehydrogenase 
staining on cryosections was carried out using a 0.2 M phosphate buffer at pH 7.6. 
For staining, 270 mg succinic acid and 10 mg nitro blue tetrazolium were freshly 
dissolved in 10 ml phosphate buffer and incubated with sections for 1h. 

To measure muscle strength, a modified mesh grip assay was applied”°. Briefly, 
mice were placed on the wired mesh of cage tops. When they firmly grabbed the 
mesh, the cage top was flipped and held at a height of 30 cm above the bench until 
the mice released their grip. The time for each mouse to remain on the mesh was 
recorded. TUNEL staining of muscle sections to detect apoptosis was performed 
according to the manufacturer’s instructions (ApopTag Peroxidase In situ 
Apoptosis Detection Kit, Millipore), using Sigma FAST 3,3'-diaminobenzidine 
(DAB) tablets as the peroxidase substrate. 

Electron microscopy. Mice were euthanized, and liver and pancreas were rapidly 
fixed in 2.5% glutaraldehyde in 0.1 M cacodylate buffer. Electron microscopy was 
performed as previously described*'. 

Mouse metabolic studies and blood tests. Food intake, O, consumption, CO, 
production, heat production and physical activity were measured in TSE 
metabolic chambers (TSE Systems) over a 4 day period. Mice in the exercise group 
were removed from the chambers and allowed to run for 50 min daily as in the 
above protocol during the day (light period). Metabolic measurements were per- 
formed for 12h daily during the dark period. Fat mass and lean tissue mass were 
determined using the Bruker Minispec mq10 scan system. Plasma glucose, lactate, 
cholesterol and triglycerides were measured using the Vitros 250 system (Ortho- 
Clinical Diagnostics). Plasma insulin (Crystal Chem), serum leptin (Crystal 
Chem) and adiponectin (Millipore) levels were determined using commercial 
ELISA kits. Glucose tolerance tests were performed by oral glucose injection 
a gkg') after a 3h fast. Blood was drawn from tail veins at the indicated time 
points after glucose injection and serum glucose levels were analysed using 
commercial glucose assay reagents (Sigma, Cat# 6918, P7119, F5803). 
Echocardiography. Cardiac function was assessed by echocardiography, using a 
Visualsonic Vevo 2100 ultrasound machine equipped with a 30-Mhz transducer 
applied to the chest wall. Ventricular dimensions and ejection fraction were 
assessed using short axis view in two-dimensional and three-dimensional modes 
using the onboard VisualSonics cardiac analysis package. 
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Muscle glucose uptake assays. Surgical catheterization of the jugular vein was 
performed on each mouse. At day 4 after catheterization, mice were transferred to 
clean cages at 9:00 a.m. to begin a 4h fast. Mice then ran on a treadmill for 50 min 
using the exercise protocol described above. At t = 50 min, a 13 1Ci bolus of Ce 
deoxyglucose was injected intravenously and mice resumed running for another 
25min according to the exercise protocol described above before euthanasia. 
Skeletal muscle (soleus) and brain tissues were dissected and homogenized in 
1.5 ml 0.5% perchloric acid. '*C-deoxyglucose counts were determined in homo- 
genized samples as previously described” and muscle values were normalized to 
the counts in the brain. 

Western blot analyses. Mouse tissue extracts were prepared by homogenizing 
tissues in lysis buffer containing 50mM Tris (pH 7.9), 150mM NaCl, 1mM 
EDTA, 1% Triton X-100, proteinase inhibitor cocktail (Roche Applied Sciences) 
and halt phosphatase inhibitor cocktail (Pierce), and subjected to western blot 
analysis with anti-LC3 (Novus Biologicals, 1:500), anti-p62 (BD Biosciences, 1:500 
dilution; Progen (C-terminal specific), 1:1,000 dilution). Anti-p-CaMKI (Santa 
Cruz Biotechnology, 1:100), anti-CaMKI (Santa Cruz Biotechnology, 1:300), 
anti-BCL2 (Santa Cruz Biotechnology, 1: 100), anti-AMPK (Cell Signaling, 
1:1,000), anti-p-AMPK (Cell Signaling, 1:1,000), anti-JNK (Cell Signaling, 
1:400), anti-p-JNK (Cell Signaling, 1:500) anti-AS160 (Millipore, 1:400) anti-p- 
AS160 (Millipore, 1:100), anti-4E-BP1 (Cell Signaling, 1:1,000), anti-p-4E-BP1 
(Cell Signaling, 1:1,000), anti-p-ACC (Millipore, 1:500) anti-AKT (Cell 
Signaling, 1:600), anti-p-AKT (Cell Signaling, 1:500), anti-p38 (Cell Signaling, 
1:500), anti-p-p38 (Cell Signaling, 1:400), anti-atrogin-1 (ECM Biosciences, 
1:400), ATG16L1 (Sigma, 1:1,000) and anti-actin (Santa Cruz Biotechnology, 
1:3,000) antibodies. 

For in vitro assessment of AMPK activation, MEFS derived from Bcl. AA and 
Bcl2" mice, Becn1*/~ and Benc1‘'* mice, and Atg16l1'™ and AtgI6l1" mice 
were treated with 2 mM of the AMP analogue AICAR, or DMSO vehicle for 2h. 
Whole-cell lysates were prepared, and AMPK and phospho-AMPK levels were 
determined by western blot analysis. For quantification, the relative intensity 
values of western blot bands were normalized to that of the first lane (set as 1) 
in the wild-type non-exercised group. 

Statistical analyses. ANOVA approaches were used to compare values among 
different experimental groups for data that met the normality assumption. One- 
way ANOVA was used for comparison between two groups. Two-way ANOVA 
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was used for comparison of the magnitude of changes between two different 
groups in mice of two different genotypes. The normality assumption for the 
ANOVA model was checked using residual plots. When the assumption was 
violated, the data were log-transformed in order to meet the assumption. For data 
sets in which log transformation was inadequate to meet the analysis assumption, 
the non-parametric Wilcoxon rank-sum test was used. 

Real-time PCR analyses. RNA isolation from muscle tissues and real-time RT- 
PCR were performed as previously described’*. The following Sybr Green primers 
were used: f-actin: forward, CTGGCTCCTAGCACCATGAAGAT; reverse, 
GGTGGACAGTGAGGCCAGGAT; UCP1: forward, TCAGGATTGGCCTCTA 
CGAG; reverse, TAAGCCGGCTGAGATCTTGT; UCP2: forward, T@CCCGTA 
ATGCCATTGTC; reverse, AGTGGCAAGGGAGGTCATCT; cytochrome c: 
forward, GI'CTGTTCGGGCGGAAGACAG; reverse. GGGGAGAGGATAC 
CCTGATGG; and cytochrome b: forward, ATTCATTGACCTACCTGCCC; 
reverse, TCTGATGTGTAGTGTATGGC. The following Tagman primers were 
used: PPARGCla (PGC1a): Mm00447183_m1; myoglobin: Mm00442968_m1; 
PPARGCI1B (PGC1B): Mm01258518_ml; MYH7 (MHC Type JD: 
Mm00600555_ml; MYH1 (MHC Type IId/x): Mm01332488_g1; MYH2 (MHC 
Type Ila): Mm01332564_m1; and GAPDH: Mm99999915_g1. 
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Complete subunit architecture of the 
proteasome regulatory particle 


Gabriel C. Lander’, Eric Estrin?*, Mary E. Matyskiela?*, Charlene Bashore?, Eva Nogales’** & Andreas Martin?* 


The proteasome is the major ATP-dependent protease in eukaryotic cells, but limited structural information restricts a 
mechanistic understanding of its activities. The proteasome regulatory particle, consisting of the lid and base 
subcomplexes, recognizes and processes polyubiquitinated substrates. Here we used electron microscopy and a new 
heterologous expression system for the lid to delineate the complete subunit architecture of the regulatory particle from 
yeast. Our studies reveal the spatial arrangement of ubiquitin receptors, deubiquitinating enzymes and the protein 
unfolding machinery at subnanometre resolution, outlining the substrate’s path to degradation. Unexpectedly, the 
ATPase subunits within the base unfoldase are arranged in a spiral staircase, providing insight into potential 
mechanisms for substrate translocation through the central pore. Large conformational rearrangements of the lid 
upon holoenzyme formation suggest allosteric regulation of deubiquitination. We provide a structural basis for the 
ability of the proteasome to degrade a diverse set of substrates and thus regulate vital cellular processes. 


The ubiquitin-proteasome system is the major pathway for selective 
protein degradation in eukaryotic cells. Covalent modification with a 
polyubiquitin chain targets damaged, misfolded and short-lived regu- 
latory proteins for ATP-dependent destruction by the 26S proteasome, 
a massive 1.5 MDa proteolytic machine. The proteasome thus controls 
a myriad of essential cellular processes, including the cell cycle, tran- 
scription and protein quality control’. Despite intensive study, however, 
the structural basis for substrate recognition and processing by the 
proteasome remains poorly understood. 

The proteasome contains at least 32 different subunits that form a 
barrel-shaped 20S proteolytic core capped on either end by a 19S 
regulatory particle. The active sites of the peptidase are sequestered 
in an internal chamber, and access is controlled by the regulatory 
particle, which functions in substrate recognition, deubiquitination, 
unfolding and translocation of the unfolded chains into the core”. 

The regulatory particle is composed of 19 subunits and can be 
divided into two subcomplexes, the lid and the base. The lid consists 
of nine non-ATPase proteins (Rpn3, Rpn5- Rpn9, Rpn11, Rpn12 and 
Sem] in yeast), including the deubiquitinating enzyme (DUB) Rpn11, 
whose activity is essential for efficient substrate degradation®’. The 
base contains six distinct AAA+ ATPases, Rptl-Rpt6, that form a 
hetero-hexameric ring (in the order Rptl, Rpt2, Rpt6, Rpt3, Rpt4, 
Rpt5; ref. 8) and constitute the molecular motor of the proteasome. 
The ATPases are predicted to use the energy of ATP binding and 
hydrolysis to exert a pulling force on substrate proteins, unfold them, 
and translocate the polypeptides through a narrow central pore into 
the peptidase chamber. In the presence of ATP, the carboxy termini of 
the ATPases bind dedicated sites on the o-subunit ring («1-«7) of 
the 20S core, triggering the opening of a gated access channel and 
facilitating substrate entry°’"’. Besides Rptl-Rpt6, the base contains 
four non-ATPase subunits: Rpnl, Rpn2 and the ubiquitin receptors 
Rpnl0 and Rpnl3. Additional ubiquitin shuttle receptors (Rad23, 
Ddil and Dsk2) are recruited to the base through interactions with 
Rpnl, which also binds a second, non-essential DUB, Ubp6 (refs 12-14). 

Whereas the proteolytic core has been well studied, there is only 
limited structural characterization of the regulatory particle'’’”. 


None of the 13 non-ATPase subunits, including the ubiquitin receptors 
and deubiquitinating enzymes, have been localized within this assembly. 
Although it has been shown that efficient degradation depends on the 
length, linkage type and placement of an ubiquitin chain, as well as 
the presence of an unstructured initiation site on a substrate*"*"’, we 
are missing the topological information needed to explain these 
requirements. Thus, elucidating the architecture of the regulatory 
particle and the spatial arrangement of individual subunits is crucial 
to understanding the molecular mechanisms for substrate recognition 
and processing. 

Here, we present the electron microscopy structure of the proteasome 
holoenzyme and the lid subcomplex. A new heterologous expression 
system for the lid facilitated the localization of all subunits within the 
regulatory particle, providing a complete architectural picture of the 
proteasome. The resulting structural understanding offers novel insight 
into the mechanisms of ubiquitin binding, deubiquitination, substrate 
unfolding and translocation by this major eukaryotic proteolytic 
machine. 


Recombinant expression of yeast lid in Escherichia coli 


We developed a system for the heterologous coexpression of all nine 
lid subunits from Saccharomyces cerevisiae in Escherichia coli. This 
system allowed us to generate truncations, deletions and fusion con- 
structs that were used to localize individual subunits and delineate 
their boundaries within the lid. The recombinant, purified lid was 
analysed in its subunit composition and stoichiometry by SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE; Supplementary 
Figs 1 and 2) and tandem mass spectrometry. The small, non-essential 
subunit Sem1 could not be detected, neither for the recombinant nor 
the endogenous lid that was isolated from yeast. All other subunits 
were present with the expected stoichiometry, and gel-filtration ana- 
lyses showed indistinguishable elution profiles for the heterologously 
expressed lid and its endogenous counterpart (data not shown). 
Furthermore, atomic emission spectroscopy confirmed that the 
essential Zn** ion was incorporated in Rpnll, indicating proper 
folding in E. coli. 


1Life Sciences Division, Lawrence Berkeley National Laboratory, University of California, Berkeley, California 94720, USA. @Department of Molecular and Cell Biology, University of California, Berkeley, 
California 94720, USA. 2Howard Hughes Medical Institute, University of California, Berkeley, California 94720, USA. 4QB3 Institute, University of California, Berkeley, California 94720, USA. 
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To compare the functionalities of recombinant and endogenous lid, 
we established conditions for their in vitro reconstitution with base and 
20S core subcomplexes from yeast to yield 26S holoenzyme. These 
reassembled particles were assayed for their activity in ubiquitin- 
dependent substrate degradation by using a polyubiquitinated green 
fluorescent protein (GFP)-cyclin fusion protein and following the 
decrease in GFP fluorescence. Proteasome reconstituted with E. coli- 
expressed lid supported robust substrate degradation (Supplementary 
Fig. 3). Importantly, the three-dimensional electron microscopy 
reconstructions from negative-stained samples of both lid sub- 
complexes are practically identical (Fig. la and Supplementary Fig. 4), 
establishing this recombinant system as an ideal tool for our structural 
studies of the regulatory particle. 


Localization of regulatory particle subunits 


Asa first step in elucidating the architecture of the regulatory particle, 
we compared the single-particle electron microscopy reconstructions 
of the yeast holoenzyme and the isolated lid subcomplex obtained at 
9- and 15-A resolution, respectively (Fig. 1b, Supplementary Figs 5-7 
and Supplementary Movie 1). Docking the five-lobed, hand-shaped 
structure of the lid into the electron density of the holoenzyme 
revealed the lid’s position on one side of the regulatory particle, form- 
ing extensive interactions with the base subcomplex, but also contact- 
ing the 20S core. The lid subunits Rpn3, Rpn5, Rpn6, Rpn7, Rpn9 and 
Rpnl2 contain a C-terminal PCI (Proteasome-CSN-eIF3) domain 
that is assumed to have scaffolding functions and allow inter-subunit 
contacts’. Our reconstruction provided sufficient resolution to un- 
ambiguously locate the winged-helix fold and the flanking helical 
segments of individual PCIs (Fig. 1c and Supplementary Movie 1). 
The C-terminal PCI domains of the six Rpn subunits thus interact 
laterally to form a horseshoe-shaped anchor from which the amino- 
terminal domains extend radially. This arrangement demonstrates 


o Endogenous lid 


Recombinant lid 
his ‘ 


> 


a> 


Figure 1 | The lid subcomplex within the holoenzyme assembly. a, Negative- 
stain three-dimensional reconstruction at approximately 15-A resolution 
shows resemblance between endogenous (left) and recombinant (right) lid. 
b, Locations of lid (yellow) and base (cyan) within the subnanometre 
holoenzyme reconstruction. ¢, Six copies of the crystal structure of a PCI 
domain (PDB ID: 1RZA4) are docked into the lid electron density, showing a 
horseshoe-shaped arrangement of the winged-helix domains. Each domain is 
coloured according to its respective lid subunit (Fig. 2). 
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the scaffolding function of PCI domains in the lid, and we predict 
that similar interactions underlie the architecture of other PCI- 
containing complexes. 

To determine the subunit topology of the lid, we used our hetero- 
logous E. coli expression system, fused maltose-binding protein 
(MBP) to the N or C terminus of individual subunits (Supplemen- 
tary Fig. 1), and localized the MBP within the tagged lid particles by 
negative-stain electron microscopy (Supplementary Fig. 8a). None of 
the MBP fusions notably affected the lid structure, and we were able to 
identify the positions of all eight essential lid subunits and the relative 
orientation of their N and C termini. In combination with the PCI 
docking, the resolution of secondary structures in the cryoelectron 
density and known molecular weights, this information allowed us to 
delineate approximate subunit boundaries (Fig. 2a and Supplemen- 
tary Movie 1). 

Overall, Rpn3, Rpn7, Rpn6, Rpn5 and Rpn9 form the fingers of the 
hand-shaped lid structure. Rpn8 shows an extended conformation 
that connects Rpn3 and Rpn9, and thus closes the PCI horseshoe. 
In addition, it interacts with Rpn11, the only essential DUB of the 
proteasome, which lies in the palm of the hand and makes extensive 
contacts with Rpn8, Rpn9 and Rpn5. 

Using the topology determined for the isolated lid subcomplex, we 
delineated the individual lid subunits in the context of the holoenzyme 
(Fig. 2b). To complete the subunit assignment for the entire regulatory 
particle, the positions of Rptl-Rpt6 in the base subcomplex were 
assigned according to established interactions with the core particle’*”®, 
whose crystal structure could be docked unambiguously into the elec- 
tron microscopy density (Supplementary Fig. 9). We localized the 
two large non-ATPases Rpnl and Rpn2 of the base subcomplex by 
antibody-labelling of a C-terminal Flag tag and N-terminal fusion of 
glutathione-S-transferase (GST), respectively (Supplementary Figs 2 
and 10a-c). Rpnl and Rpn2 had been predicted to contain numerous 
tetratricopeptide repeat (TPR)-like motifs and adopt «-solenoid struc- 
tures*’. Indeed, we found a high structural resemblance between Rpn1 
and Rpn2, both consisting of a strongly curled solenoid that transitions 
into an extended arm towards the C terminus (Fig. 3a). Rpn1 contacts 
the C-terminal helix of the 20S core subunit «4 and, based on the 
variability observed in our electron microscopy images, is likely to be 
flexible or loosely attached to the side of the base. Previous crystal- 
lography studies of the archaeal proteasome homologue PAN revealed 
that the N-terminal domains of the ATPases form a separate hexameric 
ring (N-ring) that consists of OB domains and three protruding coiled- 
coil segments'’**. Each coiled coil is formed by the far N-terminal 
residues of two neighbouring ATPases in the hexamer. Although 
Rptl and Rpt2 do not seem to form an extended coiled coil, we find 
that the N-terminal helical portion of Rpt1 interacts with the solenoid 
and the C-terminal arm of Rpn1. Rpn2 is located above the N-ring and 
mounted atop the longest of the protruding coiled coils, formed by Rpt3 
and Rpt6. These interactions strongly resemble those observed between 
Rptl and Rpn1 (Fig. 3a). 

Localizing the ubiquitin receptors and DUBs within the regulatory 
particle is of particular interest. In addition to the DUB Rpn11 in the 
lid, we identified the positions of both intrinsic ubiquitin receptors, 
Rpn10 and Rpn13, and of the base-associated DUB Ubp6 by imaging 
proteasome particles from yeast deletion strains (Fig. 3b and Sup- 
plementary Fig. 10d-f). The ubiquitin receptor Rpn13 binds to 
Rpn2 as expected****. The globular VW A domain of the second receptor 
Rpnl0 has been shown previously to stabilize the lid-base inter- 
action’>”*; however, we found that it does not contact the base directly. 
This domain bridges Rpn11 and Rpn9, which might increase the lid- 
base affinity indirectly by stabilizing Rpnl11 in its Rpn2-bound con- 
formation (see below). The flexibly attached ubiquitin interacting motif 
(UIM) of Rpn10 probably contacts the coiled coil formed by Rpt4 and 
Rpt5, stabilizing its position relative to other subunits and potentially 
communicating with the AAA+ motor. The DUB Ubp6 seems to be 
flexible and does not give rise to ordered density. Nonetheless, variance 
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Figure 2 | Three-dimensional reconstructions of the recombinant lid 
subcomplex and the yeast 26S proteasome. a, Negative-stain reconstruction 
of the isolated lid subcomplex at 15-A resolution, coloured by subunit and 
shown from the exterior (left), the side (middle) and the interior, base-facing 
side (right). A dotted line (middle) indicates the highly variable electron density 


maps indicate that it interacts with the C-terminal arm of Rpn1, as 
suggested by immunoprecipitations™. 


Inter-subcomplex contacts 

The complete localization of subunits within the holoenzyme revealed 
unexpected contacts between the lid and core subcomplexes. Rpn5 
and Rpné form fingers that touch the C termini of the core subunits 
a1 and «2, respectively. We confirmed the interaction between Rpn6 
and «2 by in vitro crosslinking, using an engineered cysteine in «2 and 
a 7-A heterobifunctional crosslinker (Supplementary Fig. 11). These 
previously unknown direct interactions between lid and core may 
stabilize the entire holoenzyme assembly, and/or be part of an allosteric 
network that modulates the activities of either subcomplex. 


Figure 3 | Localization of Rpn1 and Rpn2, and ubiquitin-interacting 
subunits. a, Rpnl1 (top) and Rpn2 (bottom) are oriented to emphasize 
similarities in their domain structure and solenoid attachment to the extended 
N-terminal helices of Rptl and Rpt3/Rpt6, respectively. b, Side and top views of 
the regulatory particle, showing the locations of the ubiquitin receptors Rpn10 


~ 
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Rpn11 
C terminus 


for the flexible N-terminal domains of Rpn5 and Rpn11. b, Subnanometre 
cryoelectron microscopy reconstruction of the holoenzyme, shown in three 
views corresponding to the isolated lid and coloured as above, with the core 
particle in grey. 


Our holoenzyme structure shows that Rpn3, Rpn7, Rpn8 and 
Rpnll make extensive contacts with the base. Compared to their 
positions in the isolated lid, Rpn8 and Rpn11 have undergone signifi- 
cant conformational changes in the holoenzyme (Fig. 4). The C ter- 
minus of Rpn8 is detached from Rpn3 to interact with the coiled coil 
of Rpt3/Rpt6, while the N-terminal MPN domain of Rpn11 extends 
towards the centre of the regulatory particle to bind the solenoid 
portion of Rpn2. Similarly, the N-terminal region of Rpn3 is more 
elongated than in the isolated lid and also contacts the Rpn2 solenoid, 
but from the opposite side. In turn, the extended C-terminal arm of 
Rpn2 interacts with Rpn3 and Rpn12, and thus forms a direct con- 
nection between the solenoid section of Rpn2, the coiled coil of Rpt3/ 
Rpt6, and the lid (Fig. 3b). 


and Rpn13, and the DUB Rpn11 relative to the central pore. Crystal structures 
for Rpn10 (PDB ID: 2X5N), Rpn13 (PDB ID: 2R2Y), and an MPN domain 
homologous to Rpn11 (AMSH-LP, PDB ID: 2ZNR) are shown docked into the 
electron microscopy density. The predicted active site of Rpn11 is indicated 
(red dot). 


00 MONTH 2012 | VOL 000 | NATURE | 3 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a Isolated lid 


Integrated lid 


( 


Rpn5d 
N terminus 


Figure 4 | Conformational rearrangements of the lid subcomplex upon 
integration into the holoenzyme. a, b, The lid complex in its isolated (left) and 
integrated (right) state is shown as viewed from the exterior (a) and top (b) of 
the regulatory particle. Major subunit rearrangements are depicted by arrows. 
The N terminus of Rpn5 (light yellow) interacts with Rpn11 in the isolated 
complex, and swings down to contact the core particle upon incorporation into 
the holoenzyme. The N-terminal domain of Rpn6 swings to the left to interact 
similarly with the core particle. Rpn3, Rpn8 and Rpn11 undergo notable 
rearrangements, in which they move towards the centre of the regulatory 
particle. 


We speculate that Rpn2 stabilizes a lid conformation in which 
Rpn3, Rpn8 and the DUB Rpn11 extend towards the base (Fig. 4b). 
Together, the lid, Rpn2 and the coiled coils of the N-ring seem to 
function asa scaffold that positions the two intrinsic ubiquitin receptors 
Rpnl0 and Rpni13, and the DUB Rpnll for substrate binding, 
deubiquitination and transfer to the subjacent central pore of the 
AAA+ motor (Fig. 3b). Interestingly, several lid subunits interact 
directly with AAA+ domains of the Rpt subunits. Rpn7 contacts the 
AAA+ domains of Rpt2 and Rpt6, while Rpn6é and Rpn5 touch Rpt3. 
These interactions with contiguous motor domains are surprising, 
because current models for ATP-dependent unfoldases suggest signifi- 
cant conformational changes of individual subunits in the hexamer 
during ATP hydrolysis and substrate translocation’”~’. The observed 
contacts between lid and the motor domains might form only transi- 
ently; alternatively, the AAA+ ring of the proteasome may be much 
more static than previously assumed. 


Lid conformational changes may regulate DUB activity 
Comparing the structures of the lid in isolation and when bound to 
holoenzyme revealed major conformational changes that suggest an 
allosteric mechanism for the regulation of Rpnll DUB activity 
(Fig. 4). In the isolated lid, the N-terminal MPN domain of Rpn11 
forms extensive interactions with Rpn9 and the curled up Rpn5 finger. 
Upon lid binding to the holoenzyme, this Rpn5 finger swings down to 
contact the «1 subunit of the 20S core and thereby releases Rpn11, 
which then extends towards the Rpn2 solenoid. Docking the MPN 
domain of a related DUB (PDB ID: 2ZNR) into the electron density of 
Rpn11 indicates the approximate location of the active site (Fig. 3b). 
The interactions of Rpn11 with Rpn9 and Rpn5 in the free lid probably 
restrict access to this active site, which would prevent futile substrate 
deubiquitination in the absence of base and 20S core, and explain 
previous observations that the lid subcomplex has DUB activity only 
within the holoenzyme’ (and our unpublished data). 


4 | NATURE | VOL 000 | 00 MONTH 2012 


Functional asymmetry in the AAA+ unfoldase 
Our subnanometre structure of the holoenzyme provides new insights 
into the architecture and potential mechanisms of the base AAA+ 
unfoldase. As suggested by previous electron microscopy studies'*"®, 
the ring of the base and the 20S core are slightly offset from a coaxial 
alignment, with the base shifted by approximately 10 A towards the 
lid (Fig. 5a). Despite or perhaps because of this offset, the C-terminal 
tails of Rpt2, Rpt3 and Rpt5 are docked into their cognate 20S binding 
pockets at the interfaces of the subunits «3 and 04, «1 and «2, and 05 
and «6, respectively. Those three Rpt tails contain the terminal HbYX 
motif, which is critical for triggering gate opening in the 20S core*”°, 
and indeed our structure is consistent with an open-gate conforma- 
tion. The tails of Rptl, Rpt4 and Rpt6 lack this motif and were not 
observed to interact statically with 20S in our holoenzyme structure. 
Current mechanistic models for AAA+ unfoldases predict that 
ATPase subunits in the hexamer are in different nucleotide states 
and undergo significant conformational changes driven by coordi- 
nated ATP hydrolysis’”*°*!. Because we determined the structure of 
wild-type proteasome in the presence of saturating ATP, we expected 
that different complexes would have any given Rpt subunit in different 
conformations, leading to reduced electron density or low resolution 
when averaging thousands of these unsynchronized motors. However, 
our reconstruction shows highly ordered density throughout the 
AAA-+ domains ofall six Rpt subunits. Whereas the C-terminal ‘small 
AAA~+’ subdomains (except for Rpt6) arrange in one plane above the 
20S core, the ‘large AAA+’ subdomains of Rpt1-Rpt5 are oriented ina 
spiral staircase around the hexameric ring, with Rpt3 at the highest and 
Rpt2 at the lowest position (Fig. 5b and Supplementary Movie 1). The 
AAA+ domain of Rpt6 adopts a tilted orientation, bridging Rpt2 and 
Rpt3. Similar staircase arrangements have been observed previously 
for helicases of the AAA+ and RecA superfamilies**”**. It was suggested 
that during ATP hydrolysis, individual subunits progress through the 
different conformational stages of the staircase, thereby translocating 
substrate through the pore. The particular staircase orientation we 
observed identically for all proteasome particles may represent a 
low-energy state of the base, adopted under our experimental condi- 
tions. Alternatively, this staircase arrangement of Rptl-Rpt6 may be 
static and reflect the functional state of the base, in which substrates are 
translocated by local motions of the pore loops while the relative posi- 
tions of the motor subunits remain fixed. Future biochemical and 
structural studies will be required to distinguish between these two 
models. 


Figure 5 | Structural features of the base ATPase subunits. a, Positions of 
Rpt2 (cyan), Rpt3 (green) and Rpt5 (orange) within the base hexameric ring 
and relative to the 20S core (grey) are shown using fitted crystal structures of the 
homologous PAN AAA+ domain (PDB ID: 3H4M). The electron microscopy 
density contains the molecular envelope of the C-terminal tails (dark blue), 
docked into their cognate binding sites on the 20S core. Corresponding 
densities were not found for the tails of Rptl, Rpt4 and Rpt6 (grey ribbon 
structure). b, Cutaway side view of the holoenzyme electron microscopy 
density with Rptl—Rpt5 visible. Individually docked copies of the PAN crystal 
structure reveal a spiral staircase arrangement of the Rpt subunits, emphasized 
by space-filling representations of the PAN pore-1 loop residues (not resolved 
in the Rpt subunits). 
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Spatial arrangement of ubiquitin receptors and DUBs 


Localizing all subunits of the regulatory particle enabled us to infer the 
requirements and potential mechanisms for the recognition and 
degradation of ubiquitin-tagged substrates (Fig. 6). After a substrate 
binds to an ubiquitin receptor, its polyubiquitin chain must be 
removed by Rpn11 cleavage at the proximal ubiquitin to permit sub- 
sequent fast degradation®’. To allow cleavage without disengaging 
from the receptor, an ubiquitin chain must be long enough to span 
the distance between receptor and DUB. Based on our structure, both 
Rpn13 and the UIM of Rpn10 are located 70-80 A from the predicted 
position of the Rpnll MPN domain (Fig. 3b). The shuttle receptors 
Rad23, Ddil and Dsk2 are expected to reside ~80-120 A away from 
Rpn11, depending on where they bind Rpn1 (ref. 13). For receptor 
interaction, at least part of the ubiquitin chain has to be in an extended 
conformation with the hydrophobic patches exposed******. Because a 
single ubiquitin moiety in an extended K48-linked chain contributes 
approximately 30A in length*®, it would take three ubiquitins to 
span the distance between Rpnl0 or Rpn13 and Rpnll. Moreover, 
both RpnlO and Rpn13 bind between two consecutive ubiquitin 
moieties****, such that at least a tetra-ubiquitin chain would be 
required on a substrate to allow interaction with a receptor and 
simultaneous deubiquitination by Rpn11 (Fig. 6). This model agrees 
with in-vitro studies that indicate a minimum of four K48-linked 
ubiquitins is necessary for efficient substrate degradation’, although 
this number may differ for other chain types”’. Given the arrangement 
of Rpnl0 and Rpn13, an ubiquitin chain would have to be signifi- 
cantly longer to interact with both receptors. However, knockout 
studies have shown that ubiquitin chains are not required to bind 
to multiple receptors simultaneously™. 

In contrast to Rpn11, Ubp6 is known to cleave within polyubiquitin 
chains or trim them from their distal end*. Of all the ubiquitin- 
interacting subunits in the regulatory particle, we found Ubp6 to be 
the furthest away from the entrance to the pore, which may allow it to 
clip extended or unnecessary ubiquitin chains from substrates. 
Because Ubpé6 is located closer to Rad23, Dsk2 or Ddil than to 
Rpnl0 or Rpn13, it may act preferentially on substrates delivered 
by these shuttle receptors. 

To avoid dissociation upon deubiquitination, a substrate polypeptide 
must be engaged with the unfolding machinery of the base before or 


Figure 6 | Model for the recognition, deubiquitination and engagement of a 
polyubiquitinated substrate by the 26S proteasome. A K48-linked tetra- 
ubiquitin chain (magenta, PDB ID: 2KDE) is conjugated to the unstructured 
initiation region of a substrate (red) and bound to the ubiquitin receptor Rpn13 
(orange). The substrate is poised for deubiquitination by Rpn11 (green, active 
site indicated by star), and its unstructured initiation region is engaged by the 
translocation machinery of the base (cyan). A polyubiquitin chain could 
alternatively bind to the UIM of Rpn10 (yellow) or interact with both receptors 
simultaneously. The DUB Ubp6 is localized further from the central pore, in a 
position to trim excess ubiquitin chains. 
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shortly after removal of its ubiquitin chain. Engagement by the base is 
known to depend on an unstructured initiation site or “tail” on the 
substrate”, which needs to be long enough to reach through the narrow 
N-ring and into the AAA+ pore (Fig. 6). In addition, this tail would 
have to be sufficiently spaced from the attachment point of the poly- 
ubiquitin chain to allow concurrent substrate engagement by the pore 
and deubiquitination by Rpn11. The distance between the predicted 
active site of Rpn11 and the AAA+ pore below the N-ring is approxi- 
mately 60 A, which could easily be bridged by 40-45 unstructured 
residues or a shorter tail combined with a folded structure. 

As an alternative to the above model for simultaneous receptor bind- 
ing and deubiquitination, it has been proposed that commencing sub- 
strate translocation by the base might move the proximal ubiquitin from 
a receptor towards Rpn11 for cleavage’. Our structure suggests for this 
model that efficient substrate processing would only require a mono- or 
diubiquitin for receptor binding and a 50-60 A longer spacing between 
the ubiquitin and the flexible tail to reach the AAA+ pore. This length 
dependence of engagement is consistent with recent in vitro degrada- 
tion studies, using model substrates with different lengths and 
ubiquitin modifications’. Future experiments will be required to assess 
whether substrates get deubiquitinated in a translocation-dependent or 
-independent manner. 


Concluding remarks 


The work presented here defines the architecture of the entire 
proteasome regulatory particle and provides a much-needed struc- 
tural framework for the mechanistic understanding of ubiquitin- 
dependent protein degradation. We localized Rpn11 directly above 
the entrance of the pore, surrounded by the ubiquitin receptors Rpn10 
and Rpn13. This insight allows us to visualize the substrate’s path 
towards degradation and will be critical in elucidating how the char- 
acteristics of ubiquitin modifications affect substrate recognition and 
processing. Moreover, our study significantly furthers the under- 
standing of the heterohexameric AAA+ motor of the proteasome. 
Individual ATPase subunits were found in a spiral staircase arrange- 
ment and may operate with more limited dynamics than previously 
assumed for AAA+ protein unfoldases. 

Unexpectedly, the lid is bound to the side of the holoenzyme and 
interacts with both the base and core particle. These interactions 
induce major conformational changes in lid subunits that may 
allosterically activate the DUB Rpn11, allowing critical removal of 
ubiquitin chains during substrate degradation in the holoenzyme, 
while preventing futile deubiquitination by the isolated lid. In addi- 
tion, contacts between the subcomplexes could have unexplored roles 
in coordinating individual substrate processing steps, for instance 
ubiquitin binding, deubiquitination, and the onset of translocation. 
The intricate architecture of the proteasome highlights the complex 
requirements for this proteolytic machine, which must accommodate 
and specifically regulate a highly diverse set of substrates in the 
eukaryotic cell. 


METHODS SUMMARY 

Protein expression and purification. Endogenous holoenzyme, core particle*! 
and lid subcomplex” were purified from S. cerevisiae essentially as described. The 
base subcomplex was purified according to protocols for the holoenzyme pre- 
paration, but with minor modifications as described in the Methods. Details of 
yeast strain construction are provided in Supplementary Table 1. 

Yeast lid was recombinantly expressed from three plasmids in E. coli BL21-star 

(DE3), and purified on anti-Flag M2 resin and by size-exclusion chromatography 
(see Methods). 
Electron microscopy and image analysis. All electron microscopy data were 
collected using the Leginon data collection software’ and processed in the 
Appion electron microscopy processing environment’. Three-dimensional 
maps were calculated using libraries from the EMAN2 and SPARX software 
packages**“°, UCSF Chimera was used for volume segmentation, atomic coord- 
inate docking and figure generation”. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Recombinant lid construction and purification. Yeast Rpn5, Rpn6, Rpn8, Rpn9 
and Rpn11-6XHis were cloned into pETDuet-1 (Novagen), yeast Rpn3, Flag- 
Rpn7 and Rpn12 were cloned into pCOLA Duet-1 (Novagen), and yeast Sem1 and 
Hsp90 were cloned into pACYCDuet-1 (Novagen). A T7 promoter preceded each 
gene and each plasmid contained a T7 terminator following the multiple cloning 
site. Genes for select rare transfer RNAs were included in the pACYCDuet-1 
plasmid to account for codon-usage differences between yeast and E. coli. To 
ensure full-length of Rpn6 in lid particles used for biochemical experiments and 
the negative stain reconstruction of recombinant lid, we used a construct with the 
Flag tag moved from Rpn7 to Rpné. E. coli BL21-star (DE3) cells were co- 
transformed with the three plasmids mentioned above. Lid proteins and the 
chaperone Hsp90 were coexpressed overnight at 18 °C after inducing cells with 
0.5 mM isopropyl-B-p-thiogalactopyranoside at D¢oo = 0.7. Cells were collected 
by centrifugation (4,000g for 30 min), resuspended in Flag buffer (50 mM HEPES, 
pH7.6, 100 mM NaCl, 100 mM KCl and 5% glycerol) supplemented with protease 
inhibitors and 2 mg ml lysozyme, and sonicated on ice for 2 min in 15-s bursts. 
The lysate was clarified by centrifugation (27,000g for 30 min), and the complex 
was affinity-purified on anti-Flag M2 resin (Sigma-Aldrich) using an N-terminal 
Flag-tag on Rpn6 or Rpn7. The protein was concentrated in a 30,000 MWCO 
concentrator (Amicon) for further purification on a Superose 6 size-exclusion 
column (GE Healthcare) equilibrated in Flag buffer. Intact, assembled lid particles 
eluted at 13.1 ml, similar to lid purified from yeast. 

Hisg-tagged yeast Rpn10 was expressed in E. coli and purified by Ni-NTA 

affinity and size-exclusion chromatography. 
Yeast strain construction. Wild-type holoenzyme was purified from the strain 
YYS40 (MATa ade2-1 his3-11,15 leu2-3,112 trp1-1 ura3-1 can1 RPN11::RPN1I- 
3XFLAG (HIS3))**. To generate RPN10, RPN13 and UBP6 deletion strains, the 
kanMX6 sequence was integrated at the respective genomic locus, replacing the 
gene in YYS40. To generate the strains used to purify GST-Rpn2, GFP-Rpn5 and 
GFP-Rpn8 holoenzyme, sequences encoding the respective tags under the control 
of the Pgari promoter were integrated 5’ of the respective genes in YYS40. To 
generate the strain used to purify Rpn1—Flag holoenzyme, a sequence encoding the 
Flag-tag was integrated 3' to RPN1 in aW303 background strain (MATa ade2-1 
his3-11 leu2-3,112 trp1-1 ura3-1 can1-100 barl). 

To generate the strains used to purify «2 mutant-containing core particle for 
the crosslinking experiments shown in Supplementary Fig. 11, pRS305 (LEU2) 
containing the mutant «2 and the genomic sequences found 500 nucleotides 
upstream and 100 nucleotides downstream of the gene was integrated at the 
LEU2 locus of RJD1144 (MATa, his3A200 leu2-3,112 lys2-801 trpA63 ura3-52 
PRE1-FLAG-6xHIS::Ylplac211 (URA3))"', and the chromosomal copy of «2 was 
deleted. To generate the strain used to purify lid with Rpn6 tagged with three 
haemagglutinin (HA) for crosslinking, the 3x HA sequence was integrated 3' of 
RPN6 in YYS40. 

Expression and purification of yeast holoenzyme and subcomplexes. 
Endogenous holoenzyme, core particle” and lid subcomplex” were purified from 
S. cerevisiae essentially as described. Frozen yeast cells were lysed in a Spex 
SamplePrep 6870 Freezer/Mill. For holoenzyme purification, lysed cells of a strain 
containing a Flag-tag on Rpnll were resuspended in lysis buffer containing 
60mM HEPES pH7.6, 50mM NaCl, 50mM KCI, 5mM MgCl, 0.5mM 
EDTA, 10% glycerol, 0.2% NP-40, and ATP regeneration mix (5mM ATP, 
0.03 mg ml~' creatine kinase, 16mM creatine phosphate). Holoenzyme was 
bound to anti-Flag M2 resin and washed with wash buffer (60 mM HEPES 
pH7.6, 50mM NaCl, 50mM KCI, 5mM MgCh, 0.5mM EDTA, 10% glycerol, 
0.1% NP-40 and 500 11M ATP) before elution with 3 x Flag peptide and separation 
over Superose-6 in gel-filtration buffer (60 mM HEPES pH7.6, 50mM NaCl, 
50mM KCl, 5mM MgCh, 0.5mM EDTA, 10% glycerol and 500M ATP). 
Lid, base or core particle were purified similarly but from different yeast strains 
and including a salt wash to separate subcomplexes. Lid was purified from a yeast 
strain containing Rpn11-Flag using a 900 mM NaCl wash. Base was purified from 
a yeast strain containing a C-terminal Flag tag on Rpn2 and including a 500 mM 
NaCl wash, with 500 1M ATP present throughout the purification. Core particle 
was purified from a yeast strain containing a Flag—6 x His tag on Pre] and includ- 
ing a 500mM NaCl wash. All subcomplexes were further purified by size- 
exclusion chromatography on Superose-6 in gel filtration buffer (see above). 
GFP degradation assay. Proteasome holoenzyme was reconstituted from 20S 
core, base, Rpn10 and recombinant or endogenous yeast lid in the presence of 
ATP. A GEFP-titin-cyclin fusion protein was modified with a K48-linked 
polyubiquitin chain” and degraded by reconstituted proteasome at 30°C in 
Flag buffer with an ATP-regeneration system (5 mM ATP, 16 mM creatine phos- 
phate, 6 1gml~' creatine phosphokinase). Degradation was monitored by the 
loss of fluorescence using a QuantaMaster spectrofluorimeter (PTI). 
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Protein crosslinking. Sulfo-MBS (Thermo Scientific) is a short (7.3 A), 
heterobifunctional crosslinker, whose maleimide moiety reacts primarily with 
sulphydryls between pH 6.5 and 7.5, and whose NHS ester reacts with primary 
amines between pH 7 and 9. We purified core particle from yeast strains in which 
the only copy of the core «2 subunit was either wild type, a D245C mutant, or an 
A249C mutant. Other intrinsic cysteines of the core were found largely non- 
reactive towards sulphydryl-modifying agents (not shown). 10 1M reduced core 
particle purified from strains containing wild type, A249C and D245C «2 was 
incubated with 150 11M sulpho-MBS for 15 min at pH 6.5, allowing conjugation of 
the crosslinker to cysteines. Core particle was buffer-exchanged to remove excess 
crosslinker and increase the pH to 7.5, activating the amine-reactive functional 
group on sulpho-MBS. This core particle was added at a final concentration of 
2M to a proteasome reconstitution mixture, containing 2 1M purified base, 
10 uM purified Rpn10, 0.5 mM ATP, and 2 uM lid purified from a yeast strain 
in which Rpn6 was C-terminally tagged with a 3xHA tag. Crosslinking was 
allowed to proceed for 15 min before reactions were stopped by the addition of 
0.5mM glycine pH7.5 and divided equally for separation by SDS-PAGE, fol- 
lowed by either Coomassie staining or anti-HA western blotting. 

Electron microscopy. Sample preparation: negative-stain analysis of both the 
purified proteasome lid and holoenzyme complexes was performed using 400 
mesh continuous carbon grids that had been plasma-cleaned in a 75% argon/25% 
oxygen atmosphere for 20s using a Solarus plasma cleaner (Gatan). Due to the 
tendency for holoenzyme to adopt a preferential orientation on the carbon sub- 
strate, 5ul of 0.1% poly L-lysine hydrobromide (Polysciences catalogue 
no. 09730) was placed onto the hydrophilized carbon grids and adsorbed for 
90s, washed twice with 511 drops of water, and allowed to dry completely. 
This polylysine step was skipped when preparing grids containing the lid samples, 
as the lid does not adopt a preferred orientation on the carbon substrate. The 
remaining steps were identical for both holoenzyme and lid. A 4-11 drop of 
sample at a concentration of 251M was placed onto the grid and allowed to 
adsorb for 1 min. The grid was blotted to near-dryness and a 4-l drop of fresh 
2% (w/v) uranyl formate was quickly placed onto the grid. To reduce the amount 
of glycerol remaining on the grids, they were subsequently floated on four 
successive 25-1] drops of the uranyl formate solution, waiting 10s on each drop. 
The grids were then blotted to dryness. 

Preservation of both lid and holoenzyme complexes in vitreous ice was per- 
formed in the same manner. 400-mesh C-flats containing 21m holes with a 
spacing of 2 um (Protochips) were plasma cleaned in a 75% argon/25% oxygen 
atmosphere for 8 s using a Solarus plasma cleaner (Gatan). The purified sample, at 
a concentration of 5 uM in a buffer containing 5% glycerol, was first diluted 1:5 
from 60 mM HEPES, pH 7.6, 50mM NaCl, 50mM KCl, 5mM MgCh, 0.5mM 
EDTA, 10% glycerol, 1mM DTT, 0.5mM ATP into a buffer containing 20 mM 
HEPES, pH 7.6, 50mM NaCl, 50mM KCl, 1mM ATP, 1mM DTT and 0.05% 
NP40, and 4-11 aliquots were placed onto the grids. Grids were immediately 
loaded into a Vitrobot (FEI company) whose climate chamber had equilibrated 
to 4°C and 100% humidity. The grids were blotted for 3 s at an offset of — 1 mm, 
and plunged into liquid ethane. The frozen grids were transferred to a grid box 
and stored in liquid nitrogen until retrieved for data collection. 

Data collection: negative-stain analysis of the lid and holoenzyme samples was 
performed using a Tecnai T12 Bio- TWIN and a Tecnai F20 TWIN transmission 
electron microscope operating at 120 keV. Lid samples were imaged at a nominal 
magnification of X 68,000 (1.57 A per pixel at the specimen level) on the T12, and 
80,000 (1.45 A per pixel) on the F20. Holoenzyme samples were imaged at a 
magnification of X 49,000 (2.18 A per pixel) on the T12, and X50,000 (2.16 A per 
pixel) on the F20. T12 data were acquired on a F416 CMOS 4Kx4K camera 
(TVIPS), F20 data were acquired on a Gatan 4Kx4K camera, and all micrographs 
were collected using an electron dose of 20e7 A~* with a randomly set focus 
ranging from —0.5 to —1.241m. The automatic rastering application of the 
Leginon data collection software was used for data acquisition. Between 300 
and 500 micrographs were collected for each of the negatively stained data sets. 

For cryoelectron microscopy, individual grids were loaded into a 626 single-tilt 
cryotransfer system (Gatan) and inserted into a Tecnai F20 TWIN transmission 
electron microscope operating at 120keV. Data were acquired at a nominal 
magnification of X100,000 (1.08A per pixel) using an electron dose of 
20e” A~? with a randomly set focus ranging from —1.2 to —2.5 yum. A total of 
9,153 micrographs were collected of the holoenzyme using the MSI-T application 
of the Leginon software. While the holoenzyme was remained intact during the 
freezing process, the isolated lid specimen became completely disassembled dur- 
ing the freezing process. In an attempt to overcome this, the isolated lid was also 
frozen using grids onto which a thin carbon film was floated. Due to the elevated 
background noise from the addition of a carbon substrate, the resulting images 
lacked the sufficient signal-to-noise ratio necessary to solve a cryoelectron 
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microscopy structure of the isolated lid to a better resolution than the negative- 
stain structure. 

Image processing of negative-stain data. All image pre-processing and two- 
dimensional classification was performed in the Appion image processing 
environment“. Due to the large number of data sets acquired for both the nega- 
tively stained lid and holoenzyme complexes, a generalized schema was used for 
image analysis. This schema also minimized user bias during comparison of tagged 
and deletion constructs with their wild-type counterparts. The contrast transfer 
function (CTF) of each micrograph was estimated concurrently with data collec- 
tion using the ACE2 and CTFFind programs”*', providing a quantitative mea- 
surement of the imaging quality. Particle selection was also performed 
automatically concurrent with data collection. Negatively stained lid particles were 
selected from the micrographs using a difference of Gaussians (DoG) transform- 
based automated picker’, and holoenzyme particles were selected using a 
template-based particle picker. Micrograph phases were corrected using ACE2, 
and both lid and holoenzyme particles were extracted using a 288 X 288-pixel box 
size. The data were then binned by a factor of two for processing. Each particle was 
normalized to remove pixels whose values were above or below 4.5 ¢ of the mean 
pixel value using the XMIPP normalization program”. 

To remove aggregation, contamination or other non-particle selections, particle 
stacks were decimated by a factor of 2 and subjected to five rounds of iterative 
multivariate statistical analysis (MSA) and multi-reference alignment (MRA) 
using the IMAGIC software package”. Two-dimensional class averages depicting 
properly assembled complexes were manually selected, and the non-decimated 
particles contributing to these class averages were extracted to create a new stack 
for further processing. To include a larger range of holoenzyme views, particles 
contributing to doubly capped proteasome averages were removed. This stack of 
particles went through five rounds of MSA/MRA in IMAGIC™, and a final cor- 
respondence analysis and classification based on Eigen images using the SPIDER 
software package’’ was performed to generate two-dimensional class averages of 
the complexes. 

Initial models for reconstructions of both the holoenzyme and lid were determined 
using the established “C1 startup” routines in IMAGIC. Two-dimensional class 
averages were manually inspected to select three images representing orthogonal 
views of the complex, which were in turn used to assign Eulers in a stepwise fashion 
to the entire data set of reference-free class averages. The resulting low-resolution 
models of the lid and holoenzyme were low-pass filtered to 60-A resolution, and 
these densities were used as starting points for refinement of the three-dimensional 
structure. 

Three-dimensional reconstructions were all performed using an iterative 
projection-matching and back-projection approach using libraries from the 
EMAN2 and SPARX software packages**“°. Refinement of the starting models 
began using an angular increment of 25°, progressing down to 2° for the lid, and 
1° for the holoenzyme. The refinement only continued to the subsequent angular 
increment once greater than 95% of the particles showed a pixel error of less than 
1 pixel. The resolution was estimated by splitting the particle stack into two 
equally sized data sets, calculating the Fourier shell correlation (FSC) between 
the resulting back-projected volumes. The estimated resolutions for the final 
endogenous and recombinant lid structures based on their FSC curves at 0.5 were 
about 15 A. 


Image processing of cryoelectron microscopy holoenzyme. Processing of the 
holoenzyme cryo data set proceeded in a very similar fashion to that of the 
negatively stained particle data sets. Only ACE2 was used to estimate CTF of 
the images and measure image quality, and particles were extracted using a box 
size of 576 pixels. Reference-free two-dimensional classification was performed to 
remove particles that did not contribute to averages depicting a doubly capped 
proteasome. Three rounds of reference-free two-dimensional classification, and 
particles were removed after each round. From an initial data set of 312,483 
automatically selected particles, 93,679 were kept for the three-dimensional 
reconstruction. C2 symmetry was applied to one of the previously determined 
asymmetric negative-stained reconstructions to serve as a starting model for 
structure refinement. The reconstruction began using an angular increment of 
25°, and iterated down to 0.6°. C2 symmetry was imposed during the reconstruc- 
tion. Low-resolution Fourier amplitudes of the final map were dampened to 
match those of an experimental GroEL SAXS curve using the SPIDER software 
package™. 

The estimated resolution based on the FSC of the half-volumes at 0.5 was 
approximately 9 A, although a local resolution calculation using the “blocres” 
function in the Bsoft package” indicated a range of resolutions within the density. 
The majority of the core particle subunits and the AAA+ ATPases were resolved 
to between 7- and 8-A resolution, whereas the non-ATPase subunits in the 
regulatory particle ranged from 8- to 12-A resolution (Supplementary Fig. 7). 
Notably, Rpn1 and the ubiquitin receptors Rpn10 and Rpn13 were the lowest 
resolution features of the holoenzyme. To filter the low-resolution portions of the 
map properly, without destroying the details of the better-ordered features, a 
resolution-driven adaptive localized low-pass filter was applied to the final 
volume (G. Cardone, personal communication). 

The segmentation analysis was manually performed using the “Volume 
Tracer” tool in the UCSF Chimera visualization software’. This software was 
additionally used to perform all rigid-body fitting of crystal structures into the 
holoenzyme cryoelectron microscopy density, as well as to generate all renderings 
for figure images. 


48. Sone, T., Saeki, Y., Toh-e, A. & Yokosawa, H. Sem1p is a novel subunit of the 26 S 
proteasome from Saccharomyces cerevisiae. J. Biol. Chem. 279, 28807-28816 
(2004). 

49. Kim, H.C. & Huibregtse, J. M. Polyubiquitination by HECT E3s and the 
determinants of chain type specificity. Mol. Cell. Biol. 29, 3307-3318 (2009). 

50. Mallick, S. P., Carragher, B., Potter, C. S. & Kriegman, D. J. ACE: automated CTF 
estimation. Ultramicroscopy 104, 8-29 (2005). 

51. Mindell, J.A.& Grigorieff, N. Accurate determination of local defocus and specimen 
tilt in electron microscopy. J. Struct. Biol. 142, 334-347 (2003). 

52. Voss, N. R., Yoshioka, C. K., Radermacher, M., Potter, C. S. & Carragher, B. DoG 
Picker and TiltPicker: software tools to facilitate particle selection in single particle 
electron microscopy. J. Struct. Biol. 166, 205-213 (2009). 

53. Sorzano, C. 0. et al. XMIPP: a new generation of an open-source image processing 
package for electron microscopy. J. Struct. Biol. 148, 194-204 (2004). 

54. van Heel, M., Harauz, G., Orlova, E. V., Schmidt, R. & Schatz, M. A new generation of 
the IMAGIC image processing system. J. Struct. Biol. 116, 17-24 (1996). 

55. Frank, J. et al. SPIDER and WEB: processing and visualization of images in 3D 
electron microscopy and related fields. J. Struct. Biol. 116, 190-199 (1996). 

56. Heymann, J. B. & Belnap, D. M. Bsoft: image processing and molecular modeling 
for electron microscopy. J. Struct. Biol. 157, 3-18 (2007). 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature10777 


A PGCl-a-dependent myokine that 
drives brown-fat-like development of 
white fat and thermogenesis 


Pontus Bostrém!, Jun Wu, Mark P. Jedrychowski?, Anisha Kordel, Li Ye!, James C. Lo‘, Kyle A. Rasbach!, 
Elisabeth Almer Bostrém®, Jang Hyun Choi’, Jonathan Z. Long’, Shingo Kajimura‘*, Maria Cristina Zingaretti’, Birgitte F. Vind®, 
Hua Tu’, Saverio Cinti®, Kurt Hojlund®, Steven P. Gygi* & Bruce M. Spiegelman! 


Exercise benefits a variety of organ systems in mammals, and some of the best-recognized effects of exercise on muscle 
are mediated by the transcriptional co-activator PPAR-y co-activator-1 « (PGCl-a). Here we show in mouse that 
PGCl-a expression in muscle stimulates an increase in expression of FNDC5, a membrane protein that is cleaved and 
secreted as a newly identified hormone, irisin. Irisin acts on white adipose cells in culture and in vivo to stimulate UCP1 
expression and a broad program of brown-fat-like development. Irisin is induced with exercise in mice and humans, and 
mildly increased irisin levels in the blood cause an increase in energy expenditure in mice with no changes in movement 
or food intake. This results in improvements in obesity and glucose homeostasis. Irisin could be therapeutic for human 
metabolic disease and other disorders that are improved with exercise. 


PGC1-z is a transcriptional co-activator that mediates many biological 
programs related to energy metabolism. Originally described as a co- 
activator of PPAR-y that modulated expression of uncoupling protein 1 
(UCP1) and thermogenesis in brown fat’, it has also been shown to 
control mitochondrial biogenesis and oxidative metabolism in many 
cell types. PGC1-o is induced in muscle by exercise and stimulates 
many of the best-known beneficial effects of exercise in muscle: 
mitochondrial biogenesis, angiogenesis and fibre-type switching”. It 
also provides resistance to muscular dystrophy and denervation-linked 
muscular atrophy’. The health benefits of elevated muscle expression of 
PGC1-% may go beyond the muscle tissue itself. Transgenic mice with 
mildly elevated muscle PGC1-« are resistant to age-related obesity and 
diabetes and have a prolonged lifespan*. This suggests that PGC1-« 
stimulates the secretion of factors from skeletal muscle that affect the 
function of other tissues. Here we show that PGC1-« stimulates the 
expression of several muscle gene products that are potentially secreted, 
including FNDCS5. The Fndc5 gene encodes a type I membrane protein 
that is processed proteolytically to form a newly identified hormone 
secreted into the blood, termed irisin. Irisin is induced in exercise and 
activates profound changes in the subcutaneous adipose tissue, stimu- 
lating browning and UCP1 expression. Importantly, this causes a sig- 
nificant increase in total body energy expenditure and resistance to 
obesity-linked insulin resistance. Thus, irisin action recapitulates some 
of the most important benefits of exercise and muscle activity. 


Muscle PGCl-a transgenics 

Mice with transgenically increased PGC1-« in muscle are resistant to 
age-related obesity and diabetes’, suggesting that these animals have a 
fundamental alteration in systemic energy balance. We therefore ana- 
lysed the adipose tissue of the PGC1-« transgenic mice for expression of 
genes related to a thermogenic gene program and genes characteristic 


of brown fat development. There were no significant alterations in the 
expression of brown-fat-selective genes in the interscapular brown 
adipose tissue or in the visceral (epididymal) white adipose tissue 
(Fig. la). However, the subcutaneous fat layer (inguinal), a white 
adipose tissue that is particularly prone to ‘browning’ (that is, forma- 
tion of multilocular, UCP1-positive adipocytes), had significantly 
increased levels of Ucp1 and Cidea messenger RNAs (Fig. 1b). We also 
observed increased UCP1 protein levels and more UCP1-positive 
stained multilocular cells in transgenic mice compared to controls 
(Fig. 1c, d). There are recent reports that exercise causes a mild increase 
in the expression ofa thermogenic gene program in the visceral adipose 
tissue, a depot that has minimal expression of these genes’. As it is the 
subcutaneous white adipose depot that has the greatest tendency to 
turn on a thermogenic gene program and alter the systemic energy 
balance of mice’, we re-investigated this with regard to browning of the 
white adipose tissues in two types of exercise. Similar to what has been 
reported’, a twofold increase in Ucp1 mRNA expression was observed 
in the visceral, epididymal fat with 3 weeks of wheel running (Sup- 
plementary Fig. 1). However, a much larger change (approximately 25 
fold) was seen in the same mice in the subcutaeneous inguinal fat 
depot. Similarly, a small increase in Ucp] mRNA expression was seen 
in the epididymal fat with repeated bouts of swimming in warm (32 °C) 
water (Supplementary Fig. 1); however a very large increase (65 fold) 
was observed in the inguinal white depot (Supplementary Fig. 1). 
Thus, muscle-specific expression of PGCla drives browning of sub- 
cutaneous white adipose tissue, possibly recapitulating part of the 
exercise program. 


Media from PGCl-a-expressing myocytes 


The effect on browning of the adipose tissues from PGC1-a-expressing 
muscle could be due to direct muscle-fat signalling or to another, 
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Figure 1 | Muscle-specific PGC1-a transgenic mice have increased brown/ 
beige fat cells in the subcutaneous depot. a, b, Quantitative polymerase chain 
reaction (qPCR) against brown fat and thermogenic genes in epididymal fat, 
BAT (a) and subcutaneous, inguinal (b) fat depots in muscle creatine kinase 
promoter (MCK)-PGC1-« transgenic mice or littermate controls. n = 7 for 
each group, repeated in a separate cohort with similar results. c, Representative 
immunohistochemistry against UCP1 in the inguinal depot from indicated 
mice. d, Western blot against UCP1 in the inguinal fat depot (n = 3 and 
repeated in an independent cohort with similar results). e, qPCR against 
indicated genes in adipocytes differentiated for 6 days from stromo vascular 
fraction (SVF) cells. This was done in the presence of conditioned media from 
primary myocytes with forced expression of GFP or PGC1-o (representative for 
three independent experiments). Data are presented as mean + s.e.m., and 
*P <0.05 compared to control group. Student’s t-test was used for single 
comparisons. NS, not significant. 


indirect mechanism. To investigate this, we treated cultured primary 
subcutaneous adipocytes with serum-free media conditioned by myo- 
cytes expressing PGC1-z or cells expressing green fluorescent protein 
(GFP). As shown in Fig. le, the media from cells expressing ectopic 
PGC1-a increased the mRNA levels of several brown-fat-specific genes 
(Fig. le). This suggested that PGC1-« causes the muscle cells to secrete 
a molecule(s) that can induce a thermogenic gene program in the cells. 


Candidate, secreted PGCl-a-dependent proteins 

We used a combination of Affymetrix-based gene expression arrays 
and an algorithm that predicts protein secretion to search for proteins 
that could mediate the browning of adipose tissues under the control 
of muscle PGC1-« (Methods). Proteins with mitochondrial targeting 
sequences were excluded, and all candidates were validated in gain-of- 
function systems for PGC1-« in vivo (Methods). Five proteins were 
identified as PGCl-« target genes in muscle and as likely to be 
secreted: IL-15, FNDC5, VEGF-B, LRG1 and TIMP4 (Fig. 2a). 
Conversely, expression of these genes was reduced in mice with 
muscle-specific deletion of PGC1-« (Supplementary Fig. 2). Furthermore, 
they were also found to be increased at the mRNA level in muscle from 
exercised mice (Fig. 2b). The expression of this same set of genes was 
also examined in muscle biopsies from human subjects before and after 
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Figure 2 | FNDCS5 is induced with forced PGC1-a expression or exercise, 
and turns on brown/beige fat gene expression. a, qPCR against indicated 
genes in skeletal muscle from MCK-PGC1-o transgenic mice or littermate 
controls (n = 7 from each group). b, qPCR against indicated genes in skeletal 
muscle from sedentary mice or mice given 3 weeks of free wheel running 

(n = 10 from each group). Mice were rested for 12 h before being killed. 

c, mRNA expression levels from human muscle biopsies before and after 10 
weeks of endurance exercise training (8 subjects included). All data points are 
normalized to baseline levels. d, SVF from the inguinal fat depot, differentiated 
into adipocytes for 6 days in the presence of saline or recombinant FNDC5 
(20 nM), IL-15 (10 pM) or VEGF- (50 tM). The graph show normalized 
mRNA levels of indicated genes. This experiment was repeated three times with 
similar results. For d, we performed one-way ANOVA tests where *P < 0.05 for 
the effect of FNDC5 on Ucp1 and Cidea expression. All other statistics were 
performed using Student’s t-test, and bar graphs are mean + s.e.m. 


a controlled period of endurance exercise’ (Fig. 2c). FNDC5, VEGFB 
and TIMP4 mRNAs were all significantly induced in humans with 
exercise. IL-15 has previously been reported as being secreted from 
muscle under the influence of exercise®, while the regulation of 
FNDC5, VEGF-8, LRG1 and TIMP4 by exercise has not been 
described. FNDC5 mRNA and brown fat markers in subcutaneous 
fat were not regulated by acute exercise, and FNDC5 mRNA was not 
induced by exposure to cold (4 °C) for 6h (Supplementary Fig. 2). 


Fndc5 induces a browning in vitro 


Several of the proteins encoded by these genes were commercially 
available, so they were applied directly to primary subcutaneous white 
adipocytes during differentiation. Factors such as IL-15 or VEGF-B 
had minimal effects on the expression of Ucp1 and other brown fat 
genes at concentrations of 200 nM or higher. However, FNDC5 pro- 
moted a sevenfold induction of Ucp1 mRNA at a concentration of 
20 nM (Fig. 2d). The transcriptional changes induced by FNDC5 were 
addressed on a global scale using gene expression arrays (Supplemen- 
tary Fig. 3). Notably, Ucp1 and three other known brown fat genes, 
Elovl3, Cox7al and Otop1, were among the eight most upregulated 
genes (Supplementary Fig. 4). Conversely, genes characteristic of 
white fat development were downregulated, such as leptin (Sup- 
plementary Fig. 3). These data indicate that the activation of browning 
and thermogenic genes by FNDCS is a major part of the action of this 
polypeptide on these cells. 

The effects of FNDC5 treatment were remarkably robust; Ucp1 
mRNA was increased 7-500 fold in more than ten experiments using 
FNDC5 at a concentration of 20 nM (Fig. 3a and Supplementary Fig. 5). 
Moreover, we could demonstrate a clear dose-dependence, with an 
effective range between 20-200 ng ml‘ (1.5-15 nM) (Fig. 3b). In con- 
trast, BMP7, reported as a potent inducer of browning’, had a much 
smaller effect (maximum of twofold) on the same cells at 3.3 uM 
(Fig. 3a). 
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Figure 3 | FNDCS5 is a potent inducer of the brown/beige fat gene program 
a, SVF from the inguinal fat depot was differentiated into adipocytes for 6 days 
in the presence of saline, recombinant FNDC5 (20 nM) or BMP7 (3.3 1M). The 
graph show normalized mRNA levels for indicated genes. Similar results were 
obtained in more than 10 experiments with the fold induction of Ucp1 between 
7-500 fold. b, mRNA levels of Ucp1 from inguinal-derived SVF treated with 
FNDCS5 for 6 days at indicated doses. c, Clark electrode measurements of 
oxygen consumption in SVF from the inguinal fat depot, differentiated into 
adipocytes for 6 days in the presence of saline or recombinant FNDC5 (20 nM). 
Data are representative for three independent experiments and normalized to 
total cellular protein. d, qPCR of Ucp1 mRNA from SVF, differentiated into 
adipocytes, and treated with FNDC5 or saline for 6 days followed by addition of 
forskolin for 8 h. +P < 0.05 compared to forskolin treatment. e, qPCR of Ppara 
after 6 days of FNDCS5 treatment (20 nM) during differentiation of primary 
SVF. f, SVF differentiated into adipocytes and treated with FNDC5 and/or 
GW6471 for 6 days. The graph shows qPCR of indicated genes. +P < 0.05 
compared to FNDCS treatment. For d and f, combined one- and two-way 
ANOVA was used. *P < 0.05. All other statistics were performed using 
Student’s t-test, and bar graphs are mean + s.e.m. 


We also used immunohistochemistry to study cells treated with 
FNDCS and observed a robust increase in UCP1-positive adipocytes 
with multilocular lipid droplets (Supplementary Fig. 4). Electron 
microscopic analysis of FNDC5-treated cells showed a higher density 
of mitochondria compared to control cells, consistent with a brown- 
fat-like phenotype and elevated mitochondrial gene expression 
(Supplementary Fig. 5). The sizes of mitochondria, however, were 
similar between groups (Supplementary Fig. 4). Lastly, measurements 
of oxygen consumption provided functional evidence of increased 
energy expenditure with FNDC5 exposure. Total oxygen consump- 
tion was greatly increased (100%) by 20nM of FNDCS5, and the 
majority of this respiration was uncoupled (Fig. 3c). Thus, FNDC5 
potently induces thermogenesis and a brown-fat-like gene program in 
cultured adipocytes. In marked contrast, FNDC5 showed little or no 
effects on the classical brown fat cells isolated from the interscapular 
depot (Supplementary Fig. 4). 

We sought to define the timeframe during the differentiation pro- 
cess when FNDC5 was effective. FNDC5 was applied to cells in 2-day 
windows from day 0-6, and this was compared to cells to which the 
protein was added during the entire 6-day differentiation process. As 
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shown in Supplementary Fig. 5, treatment during days 3-6 was effec- 
tive at inducing Ucp1 mRNA, although not as effective as when FNDC5 
was present throughout the differentiation process. Furthermore, 
treatment during the initial 2 days had no effect on UCP1 levels, sug- 
gesting that FNDC5 acts mainly during the differentiation process of 
cells committed to the adipocyte lineage. Cyclic AMP (cAMP) is an 
important signalling pathway in thermogenesis, promoting the brown- 
fat gene program downstream of f-adrenergic stimulation. We 
therefore asked whether FNDCS5 effects were additive or redundant 
with cAMP signalling. As shown in Fig. 3d, FNDC5-exposed cells 
increase UCP1 expression in an additive manner when exposed to 
forskolin, an adenyl cyclase activator. Two-way ANOVA tests demon- 
strated that there was a significant (P< 0.01) interaction between 
FNDCS5 and forskolin treatment, indicating synergistic effects. 


PPAR-a acts downstream of Fndc5 


A key question is how FNDCS is able to stimulate a thermogenic gene 
program. One potentially important transcription factor induced by 
FNDCS5, identified using gene expression arrays, was PPAR-«. This 
nuclear receptor has been shown to drive Ucp1 expression and several 
other genes involved in browning of adipose cells'®. Ppara is increased 
threefold at the mRNA level by FNDCS treatment (Fig. 3e). Importantly, 
the FNDC5-mediated increase in UCP1 was significantly reduced when 
cells were simultaneously subjected to the selective PPAR-« antagonist 
GW6471 (Fig. 3f). The functional interaction between the FNDC5 and 
GW6471 treatments on UCP] expression was confirmed using two-way 
ANOVA (P < 0.05). Conversely, the PPAR-« antagonist normalized the 
reduction seen in white adipose genes leptin and adiponectin after 
FNDC5 treatment. Together, these data indicate that FNDC5 acts to 
induce Ucp1 gene expression, at least in part, via PPAR-o. 


Irisin is a cleaved and secreted fragment of Fndc5 


FNDCS (also known as FRCP2 and PeP), was previously shown to 
have a signal peptide, two fibronectin domains and one hydrophobic 
domain that is likely to be membrane inserted’? (Fig. 4a). Previous 
studies did not investigate whether part of this protein might be 
secreted''*, With this structure in mind, we considered the possibility 
that FNDC5 might be synthesized as a type I membrane protein, 
followed by proteolytic cleavage and release of the amino (N)- 
terminal part of the protein into the extracellular space. Thus, any 
carboxy (C)-terminal or N-terminal tags would be lost during proces- 
sing of the mature protein or interfere with localization. Indeed, 
expression of a C-terminally Flag-tagged FNDC5 (Fig. 4a) did not 
result in any Flag immunoreactivity in the medium from cells expres- 
sing this construct (Fig. 4b). However, when we immunoblotted the 
same samples with an antibody that recognizes the endogenous 
FNDC5 protein, we could easily detect substantial amounts of 
FNDCS in the media at approximately 32 kilodalton (kDa): this is 
slightly larger than the cellular FNDC5 (Fig. 4b). These data indicate 
that FNDC5 is C-terminally cleaved, secreted and possibly further 
modified. 

Western blot of media fractions with antibodies against wild-type 
FNDCS5 showed multiple bands, suggestive of glycosylation, a common 
feature of secreted proteins. Treatment of supernatants from FNDC5- 
expressing cells with peptide N-glycosidase F (PNGase F) resulted in a 
significant size decrease as detected by SDS—polyacrylamide gel elec- 
trophoresis (SDS-PAGE), from 32 kDa to 20 kDa (Fig. 4c). 

Mass spectrometry was used to determine the sequence of the 
FNDC5-derived polypeptide found in the media (Methods). To do this, 
we fused the N terminus of FNDC5 (without the signal peptide) to the C 
terminus of the crystallisable fragment (Fc) domain of immunoglobulin 
G (IgG). After purification and enzymatic deglycosylation of the 
secreted material, mass spectrometry analyses indicated that secreted 
FNDC5 was truncated at glutamic acid 112 (not including the signal 
sequence), as shown in Fig. 4e. The secreted portion of FNDC5 has 
remarkable conservation between species, with 100% identity between 
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Figure 4 | FNDCS5 is proteolytically cleaved and secreted from cells. 
a, Schematic representation of the FNDC5 protein structure (top), Flag-tagged 
FNDCS protein (middle) and irisin (bottom). C, C-terminal domain; 
H, hydrophobic domain; SP, signal peptide. b, HEK 293 cells transfected with a 
vector expressing the C-terminal Flag-tagged FNDC5 (CTE-F5, third panel 
from a), followed by isolation of cell and culture media protein. Samples were 
adjusted for protein content and western blots were performed against the Flag 
antigen (left) or FNDC5 (right). This was repeated in several experiments with 
similar results. Adjusting for volume (instead of protein content) also gave 
similar results. C, cell fraction; M, media fraction. Arrows indicate molecular 
weight. c, HEK 293 cells transfected with a vector expressing FNDC5-CTF, 
followed by isolation of cell and media protein. Respective protein fraction was 
then treated with PNGase F followed by western blot against FNDC5 after 
SDS-PAGE. d, Representation of the full-length FNDC5 and the irisin 
fragment mapped by mass spectrometry (underlined). 


mice and humans (Supplementary Fig. 6). Because this distinct, 
secreted polypeptide has not been previously described and signals 
from muscle to other tissues we named it irisin, after Iris, the Greek 
messenger goddess. 

The ability of the anti-FNDC5 antibodies to react with irisin allowed 
us to investigate the contribution made by irisin to the browning 
activity caused by muscle cells expressing PGC1-«. Media conditioned 
by muscle cells that had forced expression of PGC1-o were incubated 
with control or anti-FNDC5 antibodies before they were applied to the 
fat cell cultures. As shown in Supplementary Fig. 6, the FNDC5 
antibody caused a marked reduction in the ability of the PGCl-o 
conditioned media to induce Ucp1 and Cidea mRNA in the primary 
inguinal cells. This suggests that irisin accounts for a significant frac- 
tion of this activity found in secreted media from muscle cells with 
forced PGC1-a expression. We cannot, however, exclude the possibility 
that other factors might also contribute to this response. 


Irisin is present in mouse and human plasma 


We next analysed levels of irisin in plasma from wild-type mice, using 
intravenous adenoviral delivery of full-length FNDCS5 as a positive 
control. This method results in strong forced expression from the liver 
and potential secretion to the plasma, where we detected irisin using 
western blot after albumin/IgG-depletion and deglycosylation. As 
seen in Fig. 5a, FNDC5-expressing virus resulted in a clear increase 
in an immunoreactive band at 22 kDa. Importantly, this was the only 
band altered on these blots. Using western blots of purified FNDC5 
protein as a quantitative standard, irisin is present in the plasma of 
control mice at a concentration of approximately 40nM. We also 
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Figure 5 | Detection of irisin in mouse and human plasma. a, Plasma from 
mice injected intravenously with adenoviral vectors (Ad) expressing FNDCS5 or 
GFP was subjected to western blot against FNDCS. b, Western blot against 
irisin in plasma from muscle-specific PGC1-«% knockout (MKO) mice or 
control littermates (flox/flox). WT, wild type. c, Western blot against irisin in 
plasma from wild-type mice or two healthy human subjects (representative for 
8 subjects analysed identically). d, Western blot against irisin in serum from 
control or 3 weeks exercised mice, followed by 12h rest. Bottom panel shows 
quantification of the bands. e, Western blot analysis of irisin in plasma from 
human subjects before and after 10 weeks of endurance exercise. Eight subjects 
in total were analysed; quantification after internal normalization is displayed 
in bottom panel. For all plasma analyses, samples were depleted for albumin/ 
IgG, and deglycosylated using PNGase F. Arrow indicated irisin band. Data are 
presented as mean = s.e.m., and *P < 0.05 compared to control group. 
Student’s t-test was used for single comparisons. 


analysed plasma of PGC1- muscle-specific knockout mice as a nega- 
tive control, and the irisin band at 22 kDa was decreased by 72% in 
these animals (Fig. 5b). Furthermore, an immunoreactive band of 
identical electrophoretic mobility was found in plasma from healthy 
human subjects (Fig. 5c). This band was greatly diminished when the 
anti-FNDCS5 antibody was neutralized with an excess of antigen 
(Supplementary Fig. 7). 

We examined blood levels of irisin after exercise in mice and 
human subjects. Mice had significantly elevated (65%) plasma con- 
centrations of irisin after they were subjected to 3 weeks of free wheel 
running (Fig. 5d). Similar analyses in healthy adult humans subjected 
to supervised endurance exercise training for 10 weeks revealed a 
twofold increase in the circulating irisin levels compared to the 
non-exercised state (Fig. 5e). Thus, irisin is present in mouse and 
human plasma, and is increased with exercise. The increase in cir- 
culating protein in both species is roughly proportional to the 
increases observed at the mRNA level in muscle (Fig. 2c). 


Irisin reduces obesity and insulin resistance 

We used adenoviral vectors to express full-length FNDC5 (ora control 
GFP) and examined its biological and therapeutic effects. This method 
resulted in a 15-fold increase in liver Fndc5 mRNA, although the liver 
expresses very low endogenous levels of this mRNA. Plasma levels of 
irisin were increased 3-4 fold (Fig. 5a). The mice did not display any 
adverse reaction, and upon gross pathological examination, there was 
no apparent toxicity in any major organ system. There was also no 
increase in plasma AST levels, and inflammatory genes were not sig- 
nificantly altered in the liver when the two groups were compared 
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(Supplementary Fig. 8). Ten days after injection, Ucp1 mRNA was 
increased by 13-fold in the subcutaneous depot relative to the same 
depot in mice receiving the virus expressing GFP (Fig. 6a, b); Cidea was 
also significantly upregulated (Fig. 6a). There were no changes in 
expression of UCP1 in the interscapular classical brown fat (BAT), 
but we did observe a minor elevation in Prdm16 and Pgcla mRNA 
(Supplementary Fig. 8). The changes in gene expression in the sub- 
cutaneous adipose tissues were accompanied by a clear increase in the 
number of UCP1-positive, multilocular adipocytes (Fig. 6c). We did 
not, however, detect any change in body weight in the GFP versus 
FNDC5 groups of animals. We observed similar results in young 
C57BL/6 mice (Supplementary Fig. 9). Thus, moderate increases in 
circulating irisin can induce browning of white adipose tissues in vivo, 
including increased expression of UCP1. 

As activation of the classical brown fat or browning of white fat has 
been shown to improve obesity and glucose homeostasis in vivo’, we 
delivered FNDC5-expressing adenovirus to mice rendered obese and 
insulin-resistance by feeding a high fat diet. We chose C57BL/6 mice 
for these experiments because they are highly prone to diet-induced 
obesity and diabetes. The expression of irisin increased Ucp1 gene 
expression to the same degree as in lean mice (Supplementary Fig. 9). 
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Figure 6 | Irisin induces browning of white adipose tissues in vivo and 
protects against diet-induced obesity and diabetes. a—c, Wild-type BALB/c 
mice were injected with 10'° GEP- or FNDC5-expressing adenoviral particles 
intravenously (n = 7 for each group). a, b, Animals were killed after 10 days and 
inguinal/subcutaneous fat pads were collected and analysed using qPCR 
analysis of indicated mRNAs (a) and western blot against UCP1 

(b). c, Representative images from immunohistochemistry against UCP1 in 
these mice. All results in a-c were repeated two times with similar results. 
d-g, C57BL/6 mice fed a 60% kcal high-fat diet for 20 weeks were intravenously 
injected with GFP- or FNDC5-expressing adenovirus and all analyses were 
done 10 days thereafter (n = 7 for both groups). d, Oxygen consumption at day 
and night. e, Body weights of mice 10 days after injection with indicated 
adenovirus. f, Fasting plasma insulin measured using enzyme-linked 
immunosorbent assay (ELISA). g, Intraperitoneal glucose tolerance test. 

h, Mice were injected intraperitoneally with 50 tg of rabbit IgG or a rabbit anti- 
FNDC5 antibody (ab) and were either subjected to swimming for 7 days or kept 
sedentary (n = 10 for all groups). Data show mRNA expression levels from 
inguinal white adipose tissue. All data in d—j were performed at least twice in a 
separate mouse cohort with similar results. +P < 0.05 compared to exercise and 
IgG. One-way ANOVA was used for statistics in h. All other statistics were 
performed using Student’s t-test, and bar graphs are mean ~ s.e.m. 
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There was also an elevation in expression of several mitochondrial 
genes (Supplementary Fig. 8). Notably, these changes occurred with 
only moderately increased irisin blood levels, threefold compared to the 
GFP-expressing mice. This effect was accompanied with a large 
increase in oxygen consumption (Fig. 6d), consistent with the gene 
expression data, and body weights of the irisin-expressing mice were 
slightly reduced after 10 days compared to GFP-expressing controls 
(Fig. 6e). These effects of irisin on mitochondrial gene expression in the 
fat were not seen in skeletal muscle in vivo or in cultured myocytes 
(Supplementary Fig. 10). Irisin expression in the mice fed a high fat diet 
caused a significant improvement in glucose tolerance when compared 
to the control mice expressing GFP. In addition, fasting insulin was also 
reduced (Fig. 6f, g). These data illustrate that even moderately increased 
levels of circulating irisin potently increase energy expenditure, reduce 
body weight and improve diet-induced insulin resistance. 

Finally, we asked whether irisin is required for the exercise-induced 
effects on the subcutaneous white fat. Injection of anti-FNDC5 
antibodies into mice before 10 days of swim training dramatically 
reduced the effect of this exercise on Ucp1 and Cidea gene expression, 
compared to injection of control antibodies (Fig. 6h). In contrast, 
Prdm16 mRNA levels were not increased with exercise and were also 
not affected by the anti-FNDC5 antibodies. Thus, irisin is required for 
a substantial part of the effect of exercise on these gene expression 
events in the browning of white fat. 


Discussion 


Exercise has the capacity to improve metabolic status in obesity and type 
2 diabetes, but the mechanisms are poorly understood. Importantly, 
exercise increases whole body energy expenditure beyond the calories 
used in the actual work performed"*. However, the relative contribution 
of the adipose tissues to this phenomenon has not been clarified. 
Because transgenic mice expressing PGCl-« selectively in muscle 
showed a remarkable resistance to age-related obesity and diabetes’, 
we sought factors secreted from muscle under the control of this co- 
activator that might increase whole body energy expenditure. We 
describe a new polypeptide hormone, irisin, which is regulated by 
PGC1-z, secreted from muscle into blood and activates thermogenic 
function in adipose tissues. Irisin is remarkable in several respects. First, 
it has powerful effects on the browning of certain white adipose tissues, 
both in culture and in vivo. Nanomolar levels of this protein increase 
UCP1 in cultures of primary white fat cells by 50 fold or more, resulting 
in increased respiration. Perhaps more remarkable, viral delivery of 
irisin that causes only a moderate increase (~3 fold) in circulating levels 
stimulates a 10-20 fold increase in UCP1, increased energy expenditure 
and an improvement in the glucose tolerance of mice fed a high fat diet. 
As this is in the range of increases seen with exercise in mouse and man, 
it is likely that irisin is responsible for at least some of the beneficial 
effects of exercise on the browning of adipose tissues and increases in 
energy expenditure. It is important to note that the evidence provided 
here does not exclude a role for other tissues besides muscle in the 
biological regulation and secretion of irisin. 

Second, the cleaved and secreted portion of FNDC5, the hormone 
irisin, is highly conserved in all mammalian species sequenced. Mouse 
and human irisin are 100% identical, compared to 85% identity for 
insulin, 90% for glucagon and 83% identity for leptin. This certainly 
implies a highly conserved function that is likely to be mediated by a 
cell surface receptor. The identity of such a receptor is not yet known. 

On the basis of the gene structure of Fndc5, with a signal peptide that 
was evidently missed in previous studies’’, we considered that FNDC5 
might be a secreted protein. Indeed, we have observed that the signal 
peptide is removed, and the mature protein is further proteolytically 
cleaved and glycosylated, to release the 112-amino-acid polypeptide 
irisin. The cleavage and secretion of irisin is similar to the release/ 
shedding of other transmembrane polypeptide hormones and hormone- 
like molecules such as epidermal growth factor (EGF) and transforming 
growth factor-% (TGF-«). 
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As the conservation of calories would probably provide an overall 
survival advantage for mammals, it seems paradoxical that exercise 
would stimulate the secretion of a polypeptide hormone that increases 
thermogenesis and energy expenditure. One explanation for increased 
irisin expression with exercise in mouse and man may be that it evolved 
as a consequence of muscle contraction during shivering. Muscle 
secretion of a hormone that activates adipose thermogenesis during 
this process might provide a broader, more robust defence against 
hypothermia. 

The therapeutic potential of irisin is obvious. Exogenously adminis- 
tered irisin induces the browning of subcutaneous fat and thermogenesis, 
and it presumably could be prepared and delivered as an injectable 
polypeptide. Increased formation of brown or beige/brite fat has been 
shown to have anti-obesity, antidiabetic effects in multiple murine 
models’, and adult humans have significant deposits of UCP 1-positive 
brown fat’. Data presented here show that even relatively short treat- 
ments of obese mice with irisin improves glucose homeostasis and 
causes a small weight loss. Whether longer treatments with irisin 
and/or higher doses would cause more weight loss remains to be deter- 
mined. The worldwide, explosive increase in obesity and diabetes 
renders attractive the therapeutic potential of irisin in these and related 
disorders. 

Another potentially important aspect of this work relates to other 
beneficial effects of exercise, especially in some diseases for which no 
effective treatments exist. The clinical data linking exercise with 
health benefits in many other diseases suggests that irisin could also 
have significant effects in these disorders. 


METHODS SUMMARY 

Primary mouse stromal vascular fractions from adipose tissues were differentiated 
as described®. FNDC5/irisin were purchased from ABNOVA (GST fused), or 
produced from Syd Laboratories (GST fused) or LakePharma (Fc fusions). 
Hydrodynamic injections’®, electron microscopy’’ and treadmil running'* were 
performed as previously described. Unless otherwise stated, bar graph data are 
presented as mean + s.e.m., and *P < 0.05 compared to control group. Student’s 
t-test was used for single comparisons and one-way ANOVA for multiple. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Materials. Antibodies against UCP1, tubulin and FNDC5 were from Abcam. 
Forskolin, insulin, dexamethasone, rosliglitazone, GW6471 and antibody against 
Flag were from Sigma. Primers for all qPCR experiments are listed in Supplemen- 
tary Table 1. Recombinant FNDC5, LRGI, IL-15, VEGF-B and TIMP4 were from 
ABNOVA (Taiwan). Coomassie staining kit and Lipofectamine 2000 were from 
Invitrogen. 

Identification of PGC1-a-dependent secreted proteins. All PGC1-0-induced 
genes as judged from gene expression analysis in MCK-PGC1-o muscle with a 
fold change of at least 2 and P < 0.05 were subjected to the following analysis. The 
protein sequence of the longest transcript were analysed in the SignalP-software”” 
(http://www.cbs.dtu.dk/services/SignalP/). Sequences with positive S, C, Y and D 
scores were considered positive for a signal sequence. All positive proteins were 
then screened for mitochondrial target sequences (http://www.cbs.dtu.dk/ 
services/TargetP/) whereas positive sequences were removed. All remaining hit 
proteins were then analysed using qPCR in muscle from MCK-PGC1-« mice and 
myocytes overexpressing PGC1-a. 

Primary cell cultures and recombinant protein treatments. The SVF from 
inguinal fat depots of 8-12-week-old BALB/C mice were prepared and differen- 
tiated for 6 days as previously described”’. Rosiglitazone was used for the two first 
days of differentiation. For all experiments, unless otherwise indicated, recom- 
binant FNDCS was added to the culture media at a concentration of 1 tg ml’ for 
the last 4 days of differentiation. Primary myoblasts were cultured and differen- 
tiated as described previously”. 

Preparation of protein fractions from cells and media. 293HEK or primary 
myocytes were transfected by standard methods or transduced with adenovirus at 
a multiplicity of infection (m.o.i.) of 20 as indicated. Twenty-four hours after 
transfection, media was removed, and cells were washed in large volumes of PBS 
five times, followed by incubation in Freestyle serum-free media (GIBCO) for 
24h. The cells and media were then collected separately, and media were cen- 
trifuged three times at 2,000g to pellet debris. Thereafter, a quarter volume of ice- 
cold TCA was added and precipitated protein was pelleted at 6,000g and washed 
three times in acetone. Pellet was then dried and resuspended in SDS-containing 
lysis buffer. Protein concentration was measured in both cell and media fraction 
and adjusted either by protein or volume as indicated. 

RT-PCR. qPCR was carried out after Trizol-based RNA extraction using 
RNAeasy (Invitrogen) and thereafter SYBR green. All data were normalized to 
TBP, 18S or indicated in-house genes and quantitative measures were obtained 
using the AAC; method. 

Western blot and quantification. Protein amounts from all samples were 
assessed using the BCA-kit (Thermo Scientific) followed by protein concentration 
normalization before all western blot experiments. Western blot was carried out 
following standard procedure and final band intensity (QL-BG) was quantified 
using BioPix iQ”. All data were normalized to background and loading controls. 
Additional methods. CLARK electrode measurements, energy expenditure in 
vivo, interaperitoneal glucose tolerance test (IGTT) and immunohistochemistry 
against UCP1 were performed as described previously®, with the exception that 
CLARK output was normalized to total cell protein. Fc-fusion construction and 
protein purification was performed by LakePharma. 

Comprehensive laboratory animal monitoring system. C57/B16J mice were fed 
a high fat (60% kcal) diet (D12492, Research Diets) for 20 weeks, starting at 6 
weeks of age. Mice were then injected with indicated doses of adenovirus expres- 
sing GFP or FNDCS, and comprehensive laboratory animal monitoring system 
(CLAMS; Columbia Instruments)-cage analysis was performed as described 
previously®. Briefly, mice were acclimated in metabolic chambers for 2 days 
before analysis in order to minimize stress. CO and O, levels were then collected 
every 36 min for a period of 3 days. Data on activity, heat generation and food 
intake were measured at more frequent intervals. Circadian rhythm was observed 
for most parameters. Data were not normalized to body weights unless otherwise 
stated. 

Mass spectrometry and peptide fingerprinting. Gel bands were digested with 
sequencing grade trypsin (Promega) or ASP-N (Sigma-Aldrich) as per manufac- 
turer’s instructions. Extracted in-gel protein digests were resuspended in 8 115% 
formic acid/5% acetonitrile, and 4 ul were analysed by microcapillary liquid 
chromatography electrospray ionization tandem mass spectrometry (LC-MS/ 
MS). Analyses were done on a LTQ Orbitrap Velos mass spectrometer 
(Thermo Fisher Scientific) equipped with a Thermo Fisher Scientific nanospray 
source, an Agilent 1100 Series binary HPLC pump and a Famos autosampler. 
Peptides were separated on a 100m X 16cm fused silica microcapillary column 
with an in-house made needle tip. The column was packed with MagicC18AQ 
Cig reversed-phase resin (particle size, 541m; pore size, 200 A; Michrom 
Bioresources). Separation was achieved through applying a 30 min gradient from 
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0-28% acetonitrile in 0.125% formic acid. The mass spectrometer was operated in 
a data-dependent mode essentially as described previously~* with a full MS scan 
acquired with the Orbitrap, followed by up to ten LTQ MS/MS spectra on the 
most abundant ions detected in the MS scan. Mass spectrometer settings were: full 
MS (automated gain control, 1 X 10°; resolution, 6 X 104; m/z range, 375-1,500; 
maximum ion time, 1,000 ms); MS/MS (AGC, 5 x 10°; maximum ion time, 
120 ms; minimum signal threshold, 4 x 10°; isolation width, 2 Da; dynamic 
exclusion time setting, 30s). After MS data acquisition, RAW files were converted 
into mzXML format and processed using a suite of software tools developed in- 
house for analysis. All precursors selected for MS/MS fragmentation were 
confirmed using algorithms to detect and correct errors in monoisotopic peak 
assignment and refine precursor ion mass measurements. All MS/MS spectra were 
then exported as individual DTA files and searched with no enzyme using the 
Sequest algorithm. These spectra were then searched against a database containing 
sequence of mouse FNDCS in both forward and reversed orientations. The fol- 
lowing parameters were selected to identify FNDC5: 10 p.p.m. precursor mass 
tolerance, 0.8 Da product ion mass tolerance, fully tryptic or ASP-N digestion, 
and up to two missed cleavages. Variable modifications were set for methionine 
(+15.994915). In addition, a fixed modification for the carbamidomethylation for 
cysteine (+57.021464) was used as well. The C-terminal fragment for FNDC5 was 
identified (KDEVTMKE) by trypsin digestion and reconfirmed by a separate 
ASP-N digestion. 

Preparation of plasma samples for western blot. Thirty-five microlitres of 
mouse or human plasma were precleared for albumin/IgG using the 
ProteoExtract-kit (CalBiochem) as recommended by the manufacturer. 
Samples were then concentrated to approximately 100 ul and >8 pg pl’, fol- 
lowed by deglycosylation of 150 ug using PNGase F (New England Biolabs). 
Totally, 80 ul were then prepared containing 1X sample buffer with reducing 
agent and 1.7 1g yl ' protein, sonicated, boiled and analysed using western blot 
against FNDC5 or indicated antibody. 

Construction of adenoviral vectors. The FNDC5 expression vector was pur- 
chased with a C-terminal Flag-tag from OriGene. The QuickChange Multi Site 
XL Directed Mutagenesis Kit (Aligent Technologies) was used to introduce a Flag 
tag downstream of the signal sequence and to mutate the C-terminal Flag tag, thus 
resulting in the N-terminal Flag (NTF)-FNDCS5 construct (Fig. 5a). The NTF and 
CTF FNDC5 constructs were then subcloned into the pENTRla vector 
(Invitrogen) and recombined into the pAd-CMV-DEST-V5 vector (Invitrogen) 
and adenovirus was produced using the virapower system (Invitrogen), including 
three rounds of amplification. Thereafter, virus was concentrated using the 
Vivapure adenopack 100 (Sartorius Stedim Biotech) and buffer exchanges to 
saline reaching a concentration of 9-10i.f.u. yl’. A GEP-containing adenovirus 
previously used was prepared in parallel. 

Transgenic mice. The MCK-PGCl-o transgenic and muscle-specific PGC1-« 
knockout mice have been described previously’. 

Exercise protocols. Twelve-week-old B6 mice were exercised either using 
swimming” or using free wheel running, as described previously”®. 

Human material and exercise training program. Blood samples and skeletal 
muscle biopsies were obtained from eight male non-diabetic individuals before 
and after 10 weeks of aerobic training as described previously’. In brief, the 
exercise-training program consisted of cycling on stationary bikes with 4—5 sessions 
of 20-35 min per week at an average intensity of ~65% of maximal oxygen con- 
sumption. Informed consent was obtained from all volunteers before participation. 
The study was approved by the Local Ethics Committee and was performed in 
accordance with the Helsinki Declaration. 
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Understanding the determinants of healthy mental ageing is a 
priority for society today’”. So far, we know that intelligence dif- 
ferences show high stability from childhood to old age** and there 
are estimates of the genetic contribution to intelligence at different 
ages”*®. However, attempts to discover whether genetic causes con- 
tribute to differences in cognitive ageing have been relatively un- 
informative’ '°. Here we provide an estimate of the genetic and 
environmental contributions to stability and change in intelligence 
across most of the human lifetime. We used genome-wide single 
nucleotide polymorphism (SNP) data from 1,940 unrelated indi- 
viduals whose intelligence was measured in childhood (age 11 
years) and again in old age (age 65, 70 or 79 years)'’”. We use a 
statistical method that allows genetic (co)variance to be estimated 
from SNP data on unrelated individuals’*"'’. We estimate that 
causal genetic variants in linkage disequilibrium with common 
SNPs account for 0.24 of the variation in cognitive ability change 
from childhood to old age. Using bivariate analysis, we estimate a 
genetic correlation between intelligence at age 11 years and in old 
age of 0.62. These estimates, derived from rarely available data on 
lifetime cognitive measures, warrant the search for genetic causes 
of cognitive stability and change. 

General cognitive ability (also known as general intelligence, or g’*) 
is an important human trait. It shows consistent and strong associa- 
tions with important life outcomes such as educational and occu- 
pational success, social mobility, health, illness and survival!®, 
Maintaining good general cognitive ability in old age is associated with 
better physical health and the ability to carry out everyday tasks’’”®. 
Intelligence differences are highly heritable from adolescence, and 
through adulthood to old age*®. Long-term follow-up studies have 
shown that about half of the phenotypic variance in general intelligence 
in old age is accounted for by its measure in childhood**. The corollary 
of this is that there are systematic changes through the life course in the 
rank order of intelligence between people; that is, some people’s 
intelligence ages better than others. The determinants of stability 
and change in intelligence across the human life course are being 
sought, and candidate determinants include a wide range of genetic 
and environmental factors’*”’°?!”?. There have been longitudinal 
studies within childhood/adolescence, middle adulthood and old age, 
but none that stretches from childhood to old age with the same indi- 
viduals (to our knowledge). Until now, the proportion of the variance 
in lifetime cognitive stability and change explained by genetic and 
environmental causes has been almost unknown. Apart from a small 
contribution from variation in the APOE gene, suggested individual 
genetic contributions to stability and change in intelligence across the 


life course are largely unreplicated”’. Therefore, an important novel 
contribution would be to partition the covariance between intelligence 
scores at either end of the human life course into genetic and environ- 
mental causes. To address this, the present study applies a new ana- 
lytical method’*""” to genome-wide association data from human 
participants with general cognitive ability test scores in childhood 
and again in old age. 

Participants were members of the Aberdeen Birth Cohort 1936 
(ABC1936) and the Lothian Birth Cohorts of 1921 and 1936 
(LBC1921, LBC1936)'"'*"”. They are community-dwelling, surviving 
members of the Scottish Mental Surveys of 1932 (the 1921-born indi- 
viduals) and 1947 (the 1936-born individuals), in which they took a 
well-validated test of general intelligence (Moray House Test) at a 
mean age of 11 years. They were traced and re-tested again in old 
age on a large number of medical and psychosocial factors for studies 
of healthy mental and physical ageing. Here, we use cognitive ability 
test data from childhood and from the first occasion of testing in old 
age for each subject. For all three cohorts, cognitive ability in old age 
was measured using the first unrotated principal component from a 
number of diverse cognitive tests. Additionally, the LBC1921 and 
LBC1936 cohorts re-took the Moray House Test in old age. Thus, 
the present study partitions into genetic and environmental causes 
the variance in stability and change in general intelligence over a 
period of between 54 and 68 years. Testing for 599,011 SNPs was 
performed on the Illumina610-Quadv1 chip (Illumina); the genotyp- 
ing of the samples in this study was described previously’” and quality 
control is described in Methods Summary. 

To estimate additive genetic and environmental contributions to 
variation in cognitive ageing we used genotype information from 
536,295 genome-wide autosomal SNPs. The method used here is a 
multivariate extension of our recently developed method, which allows 
the estimation of distant relationships between conventionally un- 
related individuals from the SNP data and correlates genome-wide 
SNP similarity with phenotypic similarity'*’’. A detailed description 
of the overall approach and statistical methods is given in Supplemen- 
tary Fig. 1 and the Supplementary Note. We used a linear mixed model 
to estimate variance components. The methodology for the estimation 
of genetic variation from population samples was described previously 
and has been applied to continuous traits, including height, body- 
mass index and cognitive ability'’*’*"”, and to disease’. The method 
is analogous to a pedigree analysis, with the important difference that 
we estimate distant relatedness from SNP markers. Because the rela- 
tionships are estimated from common SNP markers, phenotypic vari- 
ance explained by such estimated relationships is due to linkage 
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disequilibrium between the genotyped markers and unknown causal 
variants'*'**!, The method estimates genetic variation from SNPs that 
are in linkage disequilibrium with unknown causal variants, and so 
provides a lower limit of the total narrow sense heritability because 
additive variation due to variants that are not in linkage disequilibrium 
with the genotyped SNPs is not captured. 

We first performed a univariate analysis of cognitive ageing (Sup- 
plementary Note), which we had defined previously as intelligence 
scores in old age phenotypically adjusted for intelligence at childhood, 
by fitting the Moray House Test of intelligence at age 11 as a linear 
covariate™*, We estimated that 0.24 (standard error 0.20) of phenotypic 
variance in cognitive ageing was accounted for by the SNP-based 
similarity matrix. We next conducted a bivariate genetic analysis of 
intelligence scores early and later in life, to partition the observed 
phenotypic covariance in intelligence measured in childhood and 
old age into genetic and environmental sources of variation. Informa- 
tion on the environmental correlation comes from the comparison of 
the two phenotypes within individuals whereas the genetic correlation 
is inferred from between-individual comparisons of the two pheno- 
types (Supplementary Note). That is, the analysis can inform us about 
genetic and environmental contributions to stability and change in 
intelligence across the life course. The phenotypic correlation between 
Moray House Test intelligence at age 11 and the general intelligence 
component in old age was 0.63 (standard error 0.02) (Table 1). The 
bivariate analysis resulted in estimates of the proportion of phenotypic 
variation explained by all SNPs for cognition, as follows: 0.48 (standard 
error 0.18) at age 11; and 0.28 (standard error 0.18) at age 65, 70 or 79 
(referred to hereafter as 65-79). The genetic correlation between these 
two traits was 0.62 (standard error 0.22), and the environmental cor- 
relation was 0.65 (standard error 0.12). From the results of the bivariate 
analyses we can make a prediction about the proportion of phenotypic 
variance explained by the SNPs for cognition at 65-79 years given the 
phenotype at age 11 years. This provided a prediction of 0.21 (standard 
error 0.20), which is consistent with the actual estimate of 0.24 
(standard error 0.20) from the univariate analysis (Supplementary 
Table 1), suggesting that the bivariate normal distribution assumption 
underlying the bivariate analysis is reasonable. Hence, the results from 
the bivariate analysis contain the full description of the genetic and 
environmental relationships between cognition at childhood, cog- 
nition at old age, and cognitive change. We re-ran this model with 
different cut-offs for relatedness (Supplementary Table 2). The esti- 
mates are very similar but with, as expected, larger standard errors for 
more stringent cut-offs, which result in a smaller sample size. This 
shows that the results are not driven by unusually high correlations 
for a few close relatives. 

In the present analyses we did not adopt the usual procedure of 
dividing the parameter estimates by the standard errors to obtain test 
statistics and accompanying P values, because the standard errors were 


Table 1 | Bivariate analysis of intelligence at age 11 and at age 65-79 


Using general intelligence 
component in old age 


Using Moray House 
Test in old age 


Estimate Standard error* Estimate Standard error* 
hy? 0.478 0.177 0.298 0.229 
ho? 0.280 O.177 0.289 0.221 
Ic 0.623 0.218 0.798 0.266 
le 0.652 0.125 0.630 0.132 
Ip 0.627 0.015 0.680 0.014 


Where h,? and ho” are variance explained by all SNPs for intelligence at age 11 and old age, 
respectively; rg is genetic correlation; r. is residual correlation; rp is phenotypic correlation. A total of 
1,940 unrelated individuals were included with the general intelligence component phenotype data at 
childhood (1,830) or old age (1,839) (1,729 individuals had both phenotypes). Of the 1,515 LBC1921 
and LBC1936 individuals, there were 1,391 with genetic information and Moray House Test scores both 
at age 11 and in old age. 

*The standard errors are estimated from a first-order Taylor series expansion about the estimated 
maximum likelihood values and may be biased downwards°*. For testing hypotheses we have used the 
likelihood-ratio test statistic, which is more accurate. 
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derived froma first-order Taylor series of the logarithm of the likelihood 
about the parameter estimates” and these can be biased for modest 
sample sizes. A more appropriate procedure is to use the likelihood- 
ratio test statistic to test the hypotheses that the genetic correlation 
coefficient is zero (no genetic correlation) or 1 (perfect genetic correla- 
tion). When using a likelihood-ratio test, the estimated genetic correla- 
tion coefficient of 0.62 has a borderline significant difference from zero 
(likelihood-ratio test statistic = 2.56, P = 0.055, one-sided test) (Sup- 
plementary Fig. 2), and does not differ significantly from 1. This was 
tested by fitting a repeatability model (which implies a genetic correla- 
tion of 1.0 and the same heritability of repeat observations) that has 
three fewer parameters than the full bivariate model. It resulted in a 
very similar value of the maximum log-likelihood value; the likelihood- 
ratio test statistic was 5.6 (P = 0.133, 3 degrees of freedom) (Sup- 
plementary Table 3). 

LBC1921 and LBC1936 had the same Moray House Test administered 
at age 11 and again in old age. The bivariate analyses were repeated, 
therefore, using the same test of intelligence in childhood and oldage in 
this subsample of the cohorts. The phenotypic correlation between 
Moray House Test intelligence at age 11 and in old age was 0.68 
(standard error 0.01) (Table 1). The bivariate analysis resulted in 
estimates of the proportion of phenotypic variation explained by all 
SNPs for the Moray House Test, as follows: 0.30 (standard error 0.23) 
at age 11; and 0.29 (standard error 0.22) at age 70-79. The genetic 
correlation between these two traits was 0.80 (standard error 0.27). 
When using a likelihood-ratio test, the estimated genetic correlation 
coefficient of 0.80 is not significantly different from zero (likelihood- 
ratio test statistic = 1.51, P=0.11). The environmental correlation 
between these two traits was 0.63 (standard error 0.13). From the 
results of the bivariate analyses we can make a prediction of the pro- 
portion of phenotypic variance explained by the SNPs for the Moray 
House Test at 70-79 years conditional on the phenotype at age 
11 years. This results in an estimate of 0.074 (standard error 0.24) (Sup- 
plementary Table 4). Although the standard errors of the estimates are 
larger because a smaller data set was used, the results are similar to 
those using the full data and it appears that the choice of phenotype at 
old age (Moray House Test or a linear combination of a number of 
tests) has not led to a bias in inference. The estimates suggest that 
cognition early and late in life are similar traits, with possibly some 
genetic variation for cognitive change. 

Using population-based genetic analyses, we have quantified, for the 
first time, the genetic and environmental contribution to stability and 
change in intelligence differences for most of the human lifespan. 
Genetic factors seem to contribute much to the stability of intelligence 
differences across the majority of the human lifespan. We provide a 
lower limit of the narrow sense heritability of lifetime cognitive ageing. 
The point estimate using a general cognitive ability component in old 
age is 0.24, albeit with a large standard error (0.20). We describe the 
estimate as a lower limit because the methods used in the present study 
allow us only to estimate the proportion of the genetic variation con- 
tributing to cognitive ageing that is captured by genetic variants in 
linkage disequilibrium with common SNPs; this will be lower than the 
total narrow sense heritability. We do not have a good estimate of the 
total amount of additive genetic variation for cognitive ageing, and so 
we cannot easily quantify any heritability that is missing from our 
estimate. Some of the possible genetic contribution we have found to 
cognitive change might be attributable to developmental change 
between age 11 and young adulthood. However, the large phenotypic 
correlation between age 11 and old-age intelligence, and the fact that 
heritability estimates of general intelligence by age 11 are at about adult 
levels”, lead us to posit that most of the genetic variation we have found 
is a contribution to ageing-related cognitive changes. The estimate of 
the genetic contribution to lifetime cognitive change was lower when, 
for a subsample, the same test was used in childhood and old age. 

The bivariate analysis conducted here quantifies how differences 
in intelligence early and late in life are attributable to environmental 
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or genetic factors. A genetic correlation of zero would imply that 
intelligence early and late in life are entirely separate traits genetically, 
and that variation in the change in intelligence from childhood to old 
age is partly genetic and a function of the heritability of intelligence 
early and late in life. At the other extreme, a genetic correlation of one 
implies that the two traits have the same genetic determinants, so that 
any variation in the change in intelligence between the two stages in life 
is purely environmental. At conventional levels of significance we 
could not rule out either a genetic correlation of zero or one; however, 
our estimates suggest that genetics and environment could each con- 
tribute substantially to the covariance between intelligence at age 11 
and old age, and that genetic factors might have a role in cognitive 
change between the two stages of the life course. 

The samples studied here comprise the birth cohorts’ survivors, 
those healthy enough to take part in the studies, and people with less 
cognitive decline. Therefore, we considered whether our estimate of 
genetic variation at older ages may be biased downwards because of 
censoring. From life tables officially published by the Scottish 
Government based on census data, we estimate that the individuals 
in our oldest sample who were born in 1921 and alive at age 11 are 
among the ~50% that were still alive at the time of sample collection. 
We know that lower childhood cognitive ability per se is associated 
with premature mortality”, which, of course, our analyses adjust for, as 
specified in the models. However, because there is a paucity of data 
about genetic influences on lifetime cognitive change, we have limited 
information with regard to how these might affect life expectancy. The 
only way to know across the lifespan would have been if all children 
(that is, the ones who survived to older ages—whom we know about— 
and the ones who did not) had been genotyped in 1947. For non- 
normative (that is, pathological) cognitive change, there are genetic 
risk factors associated with younger-onset Alzheimer’s disease that 
result in premature mortality, but such strongly heritable disease is 
rare and the genes do not seem to affect normative cognitive ageing in 
those aged 70 years and over’. Hence, this is not a concern with regard 
to our analyses. APOE ¢4 is a well-known risk factor for non-normative 
cognitive decline, but any differential effect on survival occurs later in 
life, and is thus unlikely to have resulted in attrition in our cohort. 
Moreover, APOE is in Hardy-Weinberg equilibrium in even our oldest 
samples”, supporting this inference. Other known genetic risk factors 
for Alzheimer’s disease have a very small effect on the risk of disease’. 
Hence, a priori, we have nothing to suggest anything but a largely 
neutral effect of genes that influence cognitive ageing on survival. 
However, if there is an effect, the example of cognition’® (by contrast 
with cognitive change) would suggest that this would be negative, 
which would somewhat reduce genetic variation in cognitive change 
across the lifespan among the survivors. 

Until now, studies aimed at finding genetic contributions to cognitive 
ageing have offered little information. They use too-short follow-up 
periods, thereby providing too small an amount of cognitive change”. 
Cognitive assessments tend to be made only within old age, even though 
cognitive ageing occurs from young adulthood onwards. They are 
largely based on behavioural data in twin samples rather than informa- 
tion on DNA variation. The present study is unusual and valuable in 
capturing over half a century of cognitive stability and change and 
examining its causes. The results here provide estimates for the genetic 
and environmental contributions to cognitive stability and change 
across most of the human lifespan. Even with almost 2,000 individuals, 
the study’s power was insufficient to achieve conventional levels of 
significance for the estimates. Our emphasis here has not been on 
the traditional significance thresholds for P values per se, but in trying 
to partition variance in cognitive ability into environmental and 
genetic causes. The phenotypes available here are rare, and so these 
point estimates are useful to guide future research. The present find- 
ings render attractive a search for genetic mechanisms of cognitive 
change across the life course. They also suggest the importance of 
environmental contributions to lifetime cognitive change. 
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METHODS SUMMARY 


Subjects. Recruitment, phenotyping and genotyping of the samples were 
described previously'’’*'’. The mental test at age 11 was a Moray House 
Test''”. In old age, general intelligence was derived using principal components 
analysis of a number of mental tests and saving scores on the first unrotated 
principal component (Supplementary Note). In old age, the assessments of general 
intelligence were made at ages as follows: ABC1936, 64.6 years (standard devi- 
ation 0.9); LBC1936, 69.5 (standard deviation 0.8); LBC1921, 79.1 (standard devi- 
ation 0.6). The LBC1921 and LBC1936 samples, but not the ABC1936, had repeat 
testing of the Moray House Test (already taken at age 11 years) at 79.1 and 69.5 
years, respectively. After applying the genome-wide complex trait analysis 
method'*”’, the distribution of inferred relationships in the samples was as shown 
in Supplementary Fig. 3. We removed one of each pair of individuals whose 
estimated genetic relatedness was >0.2. We retained 1,940 individuals with child- 
hood or old-age phenotype data (1,729 individuals had both): ABC1936, 425; 
LBC1921, 512; and LBC1936, 1,003. Of the 1,515 LBC1921 and LBC1936 indivi- 
duals, there were 1,391 with genetic information and Moray House Test scores at 
age 11 and in old age. 

Genotyping quality control. Quality control procedures were performed per SNP 
and per sample. Individuals were excluded from further analysis if genetic and 
reported gender did not agree. Samples with a call rate = 0.95, and those showing 
evidence of non-European descent by multidimensional scaling, were removed”. 
SNPs were included in the analyses if they met the following conditions: call 
rate = 0.98, minor allele frequency = 0.01, and Hardy-Weinberg equilibrium test 
with P=0.001. After these quality control stages, 1,948 samples remained 
(ABC1936, N = 426; LBC1921, N=517; LBC1936, N= 1,005), and 536,295 
autosomal SNPs were included in the analysis. 
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A novel sensor to map auxin response and 
distribution at high spatio-temporal resolution 


Géraldine Brunoud!, Darren M. Wells”*, Marina Oliva’, Antoine Larrieu?**, Vincent Mirabet', Amy H. Burrow’, Tom Beeckman’®, 
Stefan Kepinski‘*, Jan Traas', Malcolm J. Bennett? & Teva Vernoux! 


Auxin is a key plant morphogenetic signal’ but tools to analyse 
dynamically its distribution and signalling during development 
are still limited. Auxin perception directly triggers the degradation 
of Aux/IAA repressor proteins” *°. Here we describe a novel Aux/ 
IAA-based auxin signalling sensor termed DII-VENUS that was 
engineered in the model plant Arabidopsis thaliana. The VENUS 
fast maturing form of yellow fluorescent protein’ was fused in-frame 
to the Aux/IAA auxin-interaction domain (termed domain II; DII)* 
and expressed under a constitutive promoter. We initially show that 
DII-VENUS abundance is dependent on auxin, its TIR1/AFBs 
co-receptors*°* and proteasome activities. Next, we demonstrate 
that DII-VENUS provides a map of relative auxin distribution at 
cellular resolution in different tissues. DII-VENUS is also rapidly 
degraded in response to auxin and we used it to visualize dynamic 
changes in cellular auxin distribution successfully during two 
developmental responses, the root gravitropic response and lateral 
organ production at the shoot apex. Our results illustrate the value of 
developing response input sensors such as DII-VENUS to provide 
high-resolution spatio-temporal information about hormone distri- 
bution and response during plant growth and development. 
Central to auxin signalling is the ubiquitin- and proteasome- 
dependent degradation of Aux/IAA catalysed by the SCF-type E3 
ubiquitin-ligase complexes SCF'*'/“F®!° (refs 2-6, 8). Aux/IAA 
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Figure 1 | DII-VENUS degradation is dependent on auxin, TIR1/AFBs and 
proteasome activity. a, Schematic representation of auxin signalling. b, Design 
of DII- VENUS; NLS, nuclear localization signal. c, d, DII- VENUS fluorescence 
in absence (c) or presence (d) of 1 1M indole-3-acetic acid. e, Dose-dependent 
degradation of DII- VENUS in wild-type and mutant roots. f, IAA28 domain II 
peptide pull-down assay with Flag—-TIR1 in the presence of indole-3-acetic acid. 


repressors form heterodimers with transcription factors termed auxin 
response factors (ARFs)”'®. Auxin directly promotes the interaction 
between TIR1/AFBs auxin co-receptors and Aux/IAAs’, thus recruit- 
ing Aux/IAAs to the SCF complex”® and derepressing ARF-bound loci. 
This allows the transcription of target genes including most Aux/IAA 
genes, hence providing a negative feedback loop (Fig. 1a)*'®. The most 
widely used tools to monitor auxin distribution in planta are DR5- 
based auxin-inducible reporters whose promoter contains several ARF 
binding sites'’'”. However, as an output of the auxin response pathway 
(Fig. la), reporter activity does not directly relate to endogenous auxin 
abundance but also reflects the contribution of a complex signalling 
pathway’. 

Monitoring the degradation of an Aux/IAA-based green fluorescent 
protein (GFP) reporter would provide a better target for an auxin 
sensor as its signal can be related more directly to hormone abundance 
(Fig. 1a)?*°"*. This has proved very challenging because Aux/IAA half- 
lives are often shorter than GFP maturation time”’*””. To overcome 
this technical limitation, we fused the VENUS fast maturing yellow 
fluorescent protein (YFP)’ to the auxin-interaction domain (termed 
domain II; DII)° from several Aux/IAA proteins and expressed these 
fusion proteins under the constitutive 35S promoter (Fig. 1b and 
Supplementary Fig. 1a, b). Confocal imaging of transgenic root apical 
tissues revealed similar fluorescence patterns but the strongest signal 
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g, h, mDII-VENUS fluorescence in the absence (g) or presence (h) of 1 1M 
indole-3-acetic acid. i, DI-VENUS signal in co-receptor quadruple mutant. 
j, DII-VENUS signal upon 1 11M indole-3-acetic acid and proteasome inhibitor 
(MG132) co-treatment. Green channel, VENUS; red channel, FM4-64. Scale 
bar, 50 um. Error bars, s.d. (WT, = 2; mutant, n = 3). 
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was obtained using domain II of the most stable Aux/IAA used, IAA28 
(Fig. 1c and Supplementary Fig. 1c-f). We thus focused our analyses 
on this form of the sensor, henceforth called DII- VENUS. 

We tested on root tissues the relationship between auxin, its response 
components and DII-VENUS in several independent ways. First, the 
DII-VENUS signal was sensitive in a dose-dependent fashion to exo- 
genous auxin treatment (Fig. 1c-e). Second, the DII peptide interacted 
with its co-receptors TIR1 (Fig. 1f), AFB1 and AFB5 (ref. 18) in an auxin- 
dependent manner. Third, introducing a mutation in the domain II 
sequence of DII- VENUS (mDII-VENUS), that disrupts the interaction 
between Aux/IAA, auxin and the TIR1/AFBs’, reduced the differential 
distribution of fluorescence (Fig. 1g; see below for description of pattern) 
and blocked its auxin-induced degradation (Fig. 1h). Fourth, DII- 
VENUS fluorescence was ubiquitously distributed in roots of the most 
strongly affected tir afb1 afb2 afb3 quadruple mutant’ (Fig. 1i) and the 
mutants were significantly less sensitive to auxin treatment (Fig. le). 
Fifth, disruption of ubiquitin-dependent breakdown of Aux/IAA 
proteins using proteasome inhibitors stabilized DII- VENUS and 
blocked its auxin-induced degradation (Fig. 1) and Supplementary 
Fig. 2). We conclude that DII- VENUS abundance is regulated by auxin 
via its receptors, consistent with the model for Aux/IAA degradation 
(Fig. 1a)’. We also demonstrated that DII- VENUS does not disrupt the 
activity of the auxin response machinery (Supplementary Fig. 3 and 
Supplementary Information). Hence, DII- VENUS directly reports, but 
does not interfere with, the input into the auxin signalling pathway. 

We next took advantage of the simple cellular organization of the 
root apex to quantify the distribution of DI-VENUS fluorescence with 
cellular definition (Fig. 2a, b and Supplementary Fig. 4). Because the 
TIR1/AFB1-3 co-receptor distribution shows only limited variations 
in the root meristem region (except in the root cap; Supplementary 
Fig. 5a-d)’’, this will confer a homogeneous perception capacity. 
Hence, the spatial distribution of DII- VENUS fluorescence is likely 
to represent an inverted auxin distribution map in the root tip. This 
conclusion is further supported by the more homogeneous fluor- 
escence distribution of mDII-VENUS (Fig. 2c and Supplementary 
Fig. 6) and by the complementary patterns of DII- VENUS and 
DRS5::VENUS expression in the quiescent centre, columella and differ- 
entiating xylem cells (Fig. 2d, e)''”°?'. However, mDII- VENUS fluor- 
escence distribution suggests a higher 35S promoter activity in the 
epidermis and cortex in the elongation zone and in the most external 
root cap cells (Fig. 2c and Supplementary Fig. 6). Lower expression of 
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the TIR/AFBs is also expected to confer a lower sensitivity to auxin in 
the root cap (Supplementary Fig. 5a—d). In both cases, this will lead to 
an underestimation of auxin levels by DII- VENUS. Analyses of TIR/ 
AFBs co-receptor distribution and of 35S promoter activity are thus 
essential to interpret the DII- VENUS pattern. 

Even considering these biases (Supplementary Information), DII- 
VENUS quantification indicates that auxin levels are reproducibly 
higher in the first two tiers of columella cells and initials, the quiescent 
centre, the stele initials and early daughters and the differentiating xylem 
cells (Fig. 2b and Supplementary Fig. 4)*”. The other cells in the root 
meristem have lower levels of auxin, with minima observed in the 
epidermis and cortex, but auxin levels significantly increased close to 
the start of the elongation zone. This increase occurs closer to the root tip 
in the epidermis and vasculature compared to the cortex (Fig. 2b). The 
DII-VENUS fluorescence map thus confirms the local maximum of 
auxin at the quiescent centre and in the columella cells'' but also allows 
visualizing the distribution of auxin in the entire root tip. It also reveals a 
previously unsuspected auxin accumulation starting at the transition 
zone between the meristem and the elongation zone. These results are in 
partial agreement with measurements of auxin concentrations in root 
tissues obtained after cell sorting”, the differences being possibly due to 
the higher resolution achieved using DII- VENUS. 

We also detected differential distributions of DII- VENUS fluor- 
escence in the vegetative shoot apical meristem (SAM), in the vascular 
tissues of the hypocotyl (Fig. 2f, g) and later during development in 
the inflorescence SAM and young floral meristems (Supplementary 
Fig. 7a-d)'*. As in the root, reduced differential expression with mDII- 
VENUS, partly complementary DR5:: VENUS patterns and distribution 
of TIR1/AFB1-3 co-receptors (Fig. 2h, i and Supplementary 
Fig. 5e-h) indicate that the distribution of DII-VENUS fluorescence 
is primarily controlled by auxin levels in the shoot apex (Supplemen- 
tary Information)'*. DII-VENUS is therefore able to report relative 
auxin distribution at high spatial resolution in various tissues and 
developmental stages. In addition, in both root and shoot tissues, 
DII-VENUS is degraded not only in cells where DR5 is expressed 
but also in cells that do not express DR5 (Figs 1c, 2a-f and Sup- 
plementary Fig. 7 and Supplementary Information)’*. This observation 
demonstrates that the Aux/IAA-ARF signalling pathway contributes 
significantly to the definition of the DR5 expression pattern. 

To analyse the temporal resolution of the DII- VENUS sensor, we 
compared dynamic changes in DII- VENUS and DR5::VENUS signals 


Figure 2 | DII-VENUS provides a sensor to map 
auxin distribution in plant tissues. a—d, Root 
meristematic tissues. a, b, DII- VENUS fluorescence 
(a) and corresponding map of relative auxin 
distribution (b; top) using the same look-up table. 
EZ, elongation zone. Tissue-specific changes in 
auxin levels along the root axis are shown 

(b; bottom). QC, quiescent centre. c, Fluorescence 
map of mDII-VENUS control. d, DR5:: VENUS. X, 
xylem axis. e, DII- VENUS (blue) and DR5::GFP 
(yellow). f-i, DII- VENUS (f, g), mDH-VENUS 

(h) and DR5:: VENUS (i) in the vegetative shoot apex 
(f, h, i) and hypocotyl (g). M, meristem; L, leaves; V, 
vasculature. Green or blue channel, VENUS; yellow 
channel in e, GFP; red channel, FM4-64 (d, e) or 
autofluorescence (f-i). In f-i the transmission 
channel has been added. Scale bars, 50 um. 
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in roots following exogenous auxin treatment (Fig. 3a, b and Sup- 
plementary Movie 1). Time-lapse confocal imaging revealed that the 
DII-VENUS signal was rapidly lost from all root tissues, whereas the 
signal in untreated roots remained stable (Fig. 3a and Supplementary 


Movie 1). Quantification of VENUS fluorescence in the root tip 


showed that a reduction in the DII-VENUS signal was detected 


minutes after auxin addition and the signal was abolished within 
60 min (Fig. 3b). In contrast, an increase in the DR5::VENUS signal 
was first detected only after 120 min (Fig. 3b). This delay is due to post- 
transcriptional processes, because quantitative reverse transcription- 
PCR (qRT-PCR) detected VENUS messenger RNA minutes after 
auxin treatment (Fig. 3b). We could further show that the DII- 
VENUS degradation kinetics upon auxin treatment is very similar in 
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to exogenous auxin application in various tissues. This observation 
strengthens our conclusion that DII- VENUS fluorescence is directly 
related to auxin levels in both shoot and root meristematic tissues but 
that co-receptor distribution needs to be considered. 

Finally, we used DII- VENUS to follow changes in auxin distribution 
during developmental processes. Roots have been proposed to bend in 
response to gravity by accumulating auxin on the lower side of root 
apical tissues*’***°. Consistent with this model, induction of the DR5 
reporter occurs after 1.5-2 h in the lateral root cap (LRC) and epidermis 
on the lower side of the root (Supplementary Fig. 10)*’. By contrast, 
within 30 min of a 90° gravity stimulus the DII- VENUS signal was 
entirely lost in these tissues on the lower side, whereas fluorescence 


was stable on the upper side (Fig. 4a). A decrease in DII- VENUS fluor- 
escence was also observed in the cortex and endodermis on both sides of 

8). These results indicate similar auxin sensitivity throughout the root the root and to a lesser extent in vascular tissues. Hence, DII- VENUS 

except for a lower sensitivity in the root cap, as already suggested by the 

distribution of the TIR/AFBs (Supplementary Fig. 5a-d). We also 4 

observed that the global dynamics of degradation of DII- VENUS 

was similar in the vegetative and inflorescence SAM, with a minimal 

fluorescence reached after 1 h (Fig. 3c and Supplementary Fig. 9). We 

conclude that DII- VENUS responds almost immediately and similarly 


different root tissues, but is slower in the root cap (Supplementary Fig. 
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Figure 3 | DII-VENUS monitors changes in auxin response and 
distribution at high temporal resolution. a, Time-course of DU-VENUS 
fluorescence following either a mock or an auxin treatment (1 WM 
1-naphthaleneacetic acid; NAA). b, Quantification of DI-VENUS and 
DRS::VENUS fluorescence and of DR5::VENUS mRNA levels in root apices 
treated with 1 uM NAA. c, Time-course of DII- VENUS fluorescence in the 
shoot apex upon 1 LM indole-3-acetic acid treatment; images are projections of 
10 confocal serial sections. Green channel, VENUS. In a the transmission 
channel has been added. Scale bars, 50 tm. 


Figure 4 | DII-VENUS allows visualization of changes in auxin distribution 
during development. a, Changes in DII- VENUS fluorescence during a root 
gravitropic response; g, gravity vector; a line was drawn near equivalent cells in 
the LRC and epidermis on the lower (L) and upper (U) side. b, Auxin build-up 
during organ initiation at the shoot apex visualized using DII- VENUS; arrow, 
site of new organ initiation; asterisks, groups of nuclei showing notable changes 
in fluorescence; F, flower. Yellow or green channel, VENUS; red channel, 
propidium iodide. Scale bars, 50 jim. 
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indicates that the changes in auxin concentration are not restricted to 
the LRC and epidermis during the gravitropic response. We next used 
DII-VENUS to follow auxin-dependent organ initiation at the SAM 
(Fig. 4b)?°?”. DII- VENUS allowed the visualization of the progressive 
build-up of auxin triggering the formation of a new organ (Fig. 4b). It 
also demonstrated significant redistribution of DII-VENUS fluor- 
escence throughout the SAM, most probably reflecting the dynamics 
of auxin transport in the tissue’. Taken together, our observations 
demonstrate that, during both shoot and root development, DII- 
VENUS detects dynamic changes in endogenous auxin distribution 
and responses that are more complex than previously thought. By 
demonstrating that DII- VENUS and DR5 fluorescence patterns are 
only partly complementary, we also provide evidence that the auxin 
signalling pathway has a key role in the spatial control of transcription 
in response to auxin during developmental processes. Finally, several 
other plant hormones have been shown to signal through degradation 
of key signalling regulators***°. Our work provides the foundation for 
building synthetic signalling sensors for different hormones to explore 
their role during development. 


METHODS SUMMARY 

Generation and characterization of DII-VENUS reporter plants. Aux/IAA 
degron sequences'® were fused to VENUS-NLS under the control of the 35S 
promoter (Supplementary Fig. 1) and recombined into gateway binary vector 
pH7m34GW (http://gateway.psb.ugent.be/). Plasmids were transformed into 
Col-0 plants by floral dipping. 

Microscopy, live imaging and chemical treatments. All images were obtained 
using laser-scanning confocal microscope. Tissue organization was visualized 
using FM4-64, propidium iodide, transmission or chlorophyll autofluorescence. 
For live imaging of shoot meristems, plants were grown on the auxin transport 
inhibitor 1-N-naphthylphthalamic acid (NPA) to produce naked apices before 
transferring to a new medium without NPA. To generate maps of relative auxin 
distribution in the root meristem, a cellular grid was generated from a median 
optical section and for each cells fluorescence was extracted from the optical 
section cutting the nucleus at its most median part. 

For analysis of chemically treated roots, 5-day-old DII- VENUS seedlings were 

transferred to media containing the chemicals at the stated concentration. For root 
live imaging, immediately following transfer, the seedlings were scanned every 2 to 
5 min for 2 h. The fluorescence intensity over identical scanned portion of the root 
(corresponding approximately to the first 200 LM from the root tip) was extracted 
at each time point. For tissue-specific kinetics, fluorescence was extracted from 
groups of nuclei with nearly identical fluorescence in the different tissues. For the 
vegetative SAM, 5-day-old DII-VENUS seedlings were mounted in water after 
removing one cotyledon to allow observation of the vegetative shoot apex before 
treating with the stated concentration of auxin. 
Pull-down assays. Pull-down assays were performed using a 35S:Flag-TIR1 line 
and biotinylated IAA28 peptide. The immunodetection of TIR1/AFB-Flag was 
performed with a 1:5,000 dilution of anti-Flag 2-peroxidase (HRP) antibody fol- 
lowed by chemiluminescent detection with ECL plus reagents. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Plant material, growth conditions and plant treatments. All transgenic plants 
were generated in the Columbia ecotype (Col-0). The tir1 afb1 afb2 afb3 quadruple 
mutant, DR5::GFP line and TIR1/AFB1-—AFB3 GUS translational fusions have been 
described*'**'. DR5::VENUS transgenic plants were generated by transforming a 
DRS5::VENUS plasmid” by floral dipping’. Plants were cultivated in vitro on MS 
medium supplemented with 1% sucrose at 22 °C and under long-day conditions 
(16h light/8 h darkness). For analysis on roots, the chemical treatments were done 
on 5-day-old plants by transferring them to liquid MS supplemented with the 
chemicals or on an MS agar supplemented with the chemicals for root live imaging. 
Indole-3-acetic acid (Sigma) or 1-naphthaleneacetic acid (NAA; Sigma) was dis- 
solved in ethanol and used at the indicated concentration. MG132 and clasto- 
lactacystin-f-lactone (lactacystin; Sigma) were dissolved in dimethylsulphoxide 
(DMSO) and used at the final concentration of 50 1M for 2.5h or 20M for 8h 
respectively. For MG132/indole-3-acetic acid co-treatments, plants were pretreated 
with MG132 for 1.5 h before adding indole-3-acetic acid. For analysis on the vegeta- 
tive shoot apex, 5-day-old seedlings were used after removing one cotyledon to allow 
observation of the vegetative shoot apex. Seedlings were mounted in water and 
treatments were done by replacing by capillarity the water with indole-3-acetic acid 
at the indicated concentration. For the inflorescence apex, the plants were trans- 
ferred to water containing indole-3-acetic acid at the indicated concentration. 
Generation of DII-VENUS transgenic plants. The DII- VENUS binary vectors 
were generated using Gateway technology and following the Multisite Gateway 
three-fragment vector construction kit protocol (Invitrogen). To generate the 
different versions of the DII-VENUS sensor (Supplementary Fig. 1), we used the 
region of IAA8, IAA9 and IAA28 starting from the conserved lysine up to the end 
of domain II (IAA8, amino acid positions 107-178; IAA9, 120-195; [AA28, 28-61; 
Supplementary Fig. 1). IAA8, IAA9 and IAA28 were chosen because their basal 
half-lives were potentially long enough’® (ranging from 15-20 min for IAA8 and 
IAA9 to 80 min for [AA28) to allow for the maturation of the fast-maturing YFP 
variant VENUS and thus for visualization of its fluorescence. We cloned the IAA8, 
IAA9 and IAA28 cDNAs by standard RT-PCR from inflorescence mRNA. The 
different wild-type sequences were then amplified by PCR (see Supplementary 
Table 1 for primers) and cloned in pDONR 221 by recombination. We then 
mutated the conserved lysine (K to R mutation) by introducing this mutation in 
the forward primers (sequence in bold replaced by AGA: see Supplementary Table 
1). To generate mDII-VENUS, site-directed mutagenesis (using standard inverted 
PCR procedures) was used to introduce the P53L mutation in the wild-type IAA28 
sequence in pDONR 221 (Supplementary Fig. 1)*’. The sequence of VENUS fused 
to the N7 nuclear localization signal** was amplified by PCR (see Supplementary 
Table 1 for primers) from a VENUS-N7 sequence cloned in pBG36 and cloned into 
PDONR P2R-P3 by recombination. Finally Aux/IAA-derived sequences were 
fused in-frame to VENUS-N7 (Supplementary Fig. 1) and put under the control 
of the strong constitutive CaMV 35S promoter, using a 35S promoter cloned in 
pDONR P4-P1R and recombination into the gateway-compatible pH7m34GW 
binary vector” (hygromycin resistance). The different plasmids were then intro- 
duced in plants by floral dipping”. 

Confocal microscopy, live imaging and quantification of fluorescence. 
Imaging was performed either on either a LSM-510 laser-scanning confocal 
microscope (Zeiss), a SP5 spectral detection confocal microscope (Leica) or an 
Eclipse Ti 2000 laser-scanning confocal microscope (Nikon). For visualization of 
the root organization the roots were stained either with FM4-64 (Invitrogen) as 
previously described” or propidium iodide (Sigma). To quantify fluorescence with 
cellular resolution in DI-VENUS and mDII-VENUS root meristems and generate 
maps of relative auxin using DII- VENUS, serial optical sections were obtained. A 
cellular grid was generated from the propidium iodide channel of the most median 
optical section using Merrysim”’. For each cell defined in the grid, we then selected 
the optical section passing through the centre of each nuclei. Fluorescence was 
then summed inside the corresponding cell from that section. The loss of fluor- 
escence due to tissue absorbance was also estimated using the spatial distribution 
of the propidium iodide channel and used to correct the fluorescence values. To 
obtain the changes in auxin levels along the root axis in the different tissues, 
fluorescence distribution was extracted and curvatures were smoothed using a 
Gaussian kernel with a sigma value of 5. 

For root live imaging, immediately after the beginning of the treatment, the 
seedlings were scanned every 2 min for 2h to follow the evolution of the DII- 
VENUS signal. To quantify fluorescence in the root tip, the average fluorescence 
intensity over identical scanned portion of the root (corresponding approximately 
to the first 200 tM from the root tip) was extracted using EZ-C1 software (v3.9, 
Nikon) and the values analysed using Microsoft Excel. For tissue specific kinetics, 
the seedlings were scanned every 5 min and fluorescence was extracted from 
groups of nuclei with nearly identical fluorescence (variations < 30%) in the dif- 
ferent tissues. The fluorescence intensity of nuclei was extracted from the different 
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tissues using the ROI tool of Fiji software (http://fijiisc/wiki/index.php/Fiji) 
and the values analysed using Microsoft Excel. For dose-dependent quantifica- 
tion of DII- VENUS signal upon auxin treatment, fluorescence was measured 1 h 
after treatment using two and three roots for wild-type and tirl afb1 afb2 afb3, 
respectively. 

For the vegetative shoot apex, seedlings were mounted into water in between 
slide and cover slip. For live imaging of vegetative shoot apex, five plants were 
treated after observation at t = 0 and followed over 90 min. For the inflorescence 
apex, observation was performed as described”*. Live imaging of the inflorescence 
apex was performed on plants grown on the auxin transport inhibitor NPA then 
transferred to a new medium without NPA as previously described”’. Initiation of 
a new organ was confirmed a posteriori by visual inspection of the apex. 

Root growth analysis, gravitropic assays and flower production rate. For ana- 
lysis of root growth, plants were grown near-vertically and root length was mea- 
sured at the indicated time. For gravitropic assays, plants were grown as detailed 
previously*° and imaged at 30 min intervals following a 90° gravi-stimulus. Root 
tip angle was measured using modified RootTrace software"! (http://www.cpib. 
ac.uk/tools-resources/roottrace). Estimation of flower production rate was done as 
described”. 

IAA28-Flag-TIR1 pull-down assays. To generate the 35S::Flag-TIR1 transgenic 
Arabidopsis line a plant expression vector containing a 3X Flag was first created by 
annealing complementary 101-base-pairs oligonucleotides including the 3 Flag 
coding sequences (see Supplementary Table 1) and cloning this fragment into 
Xbal and Sall sites of the vector pFP101. The Gateway Cl cassette (Invitrogen 
Gateway vector conversion reagent system) was then introduced into this plasmid 
by blunt-end ligation following Sall digestion and end-filling by Klenow reaction 
to create the destination vector pFP3FLAGSIL The full-length coding sequence for 
TIRI was amplified from an Arabidopsis cDNA library using Gateway-compatible 
primers (see Supplementary Table 1) and incorporated into the Gateway donor 
vector pDONR207 by BP reaction (Invitrogen). The TIR1 coding sequence was 
then incorporated into pFP3FLAGSII via a Gateway vector. LR reaction was used 
to form plasmid pFP3FLAGSII-TIRI. Wild-type Arabidopsis plants were sub- 
sequently transformed using the floral dip method” and homozygous lines with 
single-site were selected from the T; generation. Extracts of 10-day-old 35S::Flag- 
TIR1 seedlings were made as described previously® and used in pull-down assays 
by combining 2.5 mg of crude extract with 5 jg of biotinylated IAA28 domain II 
peptide (biotinyl- NH-EVAPVVGWPPVRSSRRN-COOH, synthesized by Thermo 
Scientific), and 65 pl 50% streptavidin-agarose suspension. The assays were incu- 
bated for 1h at 4°C with mixing then washed three times for 5 min in extraction 
buffer (0.15 M NaCl, 0.5% Nonidet P40, 0.1 M Tris-HCl pH 7.5, containing 1 mM 
phenylmethylsulphony! fluoride, 1 14M dithiothreitol, 10 1M MG132) containing 
the appropriate auxin treatment. The final processing of the pull-down assays includ- 
ing electrophoresis and western transfer were performed as described previously’. 
The immunodetection of TIR1/AFB-Flag was performed with a 1:5,000 dilution of 
anti-Flag 2-peroxidase (HRP) antibody (Sigma) followed by chemiluminescent 
detection with ECL plus reagents (Amersham). 

Transcript profiling. Total RNA was extracted from roots using an RNeasy Plant 
Micro Kit (Qiagen), including on-column DNase digestion to eliminate genomic 
DNA from the samples. A 500-ng aliquot of RNA was reverse-transcribed using a 
Transcriptor First Strand cDNA synthesis kit (Roche) and anchored-oligo (dT)18. 
Real-time qPCRs were performed on a Roche Light Cycler 480 system using the 
prevalidated single hydrolysis probes, Sensimix probe master mix (Quantace), and 
gene-specific primers (see Supplementary Table 1). 
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MATERIALS SCIENCE 


A fresh twist for self-assembly 


Molecular helicity affects many of the bulk properties of materials. A study finds that helicity also controls the self-assembly 
of colloidal particles, opening the door to a new generation of functional materials. SEE LETTER P.348 


VOLKER SCHALLER 
& ANDREAS R. BAUSCH 


r | Ahe next time you go to the super- 
market, take a look at the pasta. 
You'll probably find every- 

thing from long, thin spaghetti to 
butterfly-shaped farfalle and twisted 
fusilli. On closer inspection, you'll 
see that the strands of spaghetti 
readily align and pack closely together, 
whereas the packing of the fusilli is 
considerably more complex. This 
complexity is due to the fusilli’s chiral- 
ity — its helical geometry. On page 348 
of this issue, Gibaud et al. report that 
such complexity of packing can be 
exploited to control the self-assembly 
of nanometre-scale particles, allow- 
ing the reversible formation of various 
architectures*. 

To closely pack two individual pieces 
of fusilli, the pasta pieces have to twist 
with respect to each other so that their 
long axes are not parallel — this is an 
effect of their chirality. But when many 
fusilli are crammed together, this twist hinders 
the ability of the pasta to align in one direction, 
as required for efficient packing. The resulting 
competition between chirality and packing 
can have astonishing consequences for geo- 
metrically constrained chiral objects: if you 
stack fusilli upright in a beaker, the axes of 
the fusilli in the centre are perfectly vertical, 
whereas those at the edges of the beaker twist 
away from this alignment (Fig. 1). 

This balance between geometric constraints 
and chiral interactions is at the heart of the 
unique properties of chiral liquid crystals. 
In ‘blue’ liquid-crystal phases, for instance, 
chiral molecules self-assemble into cylinders 
of twisted molecules that stack in a cubic 
lattice, giving rise to vividly coloured and reflec- 
tive materials’. And in smectic liquid crystals, 
chiral interactions lead to a twisted, layered 
structure that is also manifested in the optical 
and mechanical properties of the materials’. 

Gibaud et al.’ examined the balance between 
chirality and geometric constraints in the self- 
assembly of colloidal particles. Their model 


“This article and the paper? under discussion were 
published online on 4 January 2012. 


Figure 1 | Pasta packing. When constrained in a circular container, 
fusilli pasta pieces mostly pack together so that their long axes 

are vertically aligned. But at the edges of the container, the pasta 
twists away from this alignment. This packing arrangement is a 
consequence of the pasta’s helicity (chirality). Gibaud et al.’ report 
that the chirality of colloidal particles affects the self-assembly of 
those particles. 


system was an ensemble of rod-like fd viruses, 
7 nanometres in diameter and 880 nm in 
length, suspended in water. The chiral- 
ity of these viruses depends on the ambient 
temperature: the rods are achiral at high tem- 
peratures, but chiral at low temperatures. The 
authors observed that, as for any other suspen- 
sion of colloidal particles’, the addition of an 
appropriate polymer that does not adsorb to 
the rods drove a phase separation of the mix- 
ture into a polymer-rich and a colloid-rich 
phase. At high temperatures, at which the rods 
are achiral, this led to the self-assembly of a 
circular colloidal membrane (a monolayer of 
vertically aligned rods’). 

The authors observed that the rods at the 
centre of the membrane readily align paral- 
lel to each other, but at the perimeter of the 
membrane they twist gradually away from 
the vertical. This twist at the margin mini- 
mizes the interface between the polymer-rich 
and the rod-rich phases, and so reduces the 
interfacial tension that arises from the imbal- 
ance of forces between particles in this region. 
But twisting rods out of alignment with the 
other rods in the membrane has its own 
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energetic cost, known as elastic energy. 
The formation of circular membranes 
with twisted margins is therefore 
the result of a trade-off between mini- 
mizing the phase interface and 
minimizing the elastic energy. 

But what happens if the rods are 
chiral, so that twisted packing is pre- 
ferred — just as it is for closely packed 
fusilli? Gibaud et al. addressed this 
question by performing experi- 
ments at lower temperatures, thereby 
‘switching on’ the chirality of the 
viruses. They observed that increases 
in chirality — that is, increases in the 
contribution of chiral interactions to 
the energy balance of the system — 
reduce the elastic energy, thus low- 
ering the energetic cost of creating 
a twist at the membrane’s margin. This 
destabilizes the edges of the circular 
membrane and triggers the formation 
of ribbon-like structures that splay 
out from the circular membrane (see 
Fig. 3a of the paper’). The effect of 
increasing the chirality of the viruses 
in the colloidal membrane is therefore similar 
to that of adding surfactants to oil-water mix- 
tures: it decreases the interfacial tension and so 
allows more interfaces to form. 

The beauty of the authors’ experimental set- 
up is that it allows all the parameters involved 
in the self-assembly of chiral colloids to be 
readily teased out and measured. Another 
advantage of the system is that the colloidal 
building blocks are much bigger than the 
molecules typically used in studies of chiral 
materials. This allowed Gibaud et al. to pre- 
cisely quantify the colloidal self-assembly 
process at all scales, from the microscopic 
movements of individual viruses to the macro- 
scopic properties of the resulting membrane. It 
also allowed them to exert well-defined local 
forces on the system, to affect the balance 
between twist and interfacial energy, and so to 
induce structural transitions — for example, 
by pulling on the membrane, they converted 
it into ribbon structures. 

By including chiral interactions in their 
system, Gibaud et al. have added a new degree 
of freedom to the self-assembly of colloidal 
particles. Transferring their results to other 


V. SCHALLER 


materials opens the door to the hierarchical 
assembly of functional materials that could 
react sensitively to changes in ambient con- 
ditions or to mechanical stimuli. One way to 
extend the functionality of their system could 
be to make colloidal particles that are not only 
chiral, but also have surfaces patterned with 
microscopic domains that attract or repel each 
other. Such patterned colloidal particles have 
recently been shown to self-assemble into useful 
functional structures such as ‘kagome lattices’. 

Chiral interactions between particles occur 
in all kinds of materials, from liquid crys- 
tals** to cytoskeletal filaments in cells (for 
which the hierarchical assembly of bundle- 
like structures is dependent on, and sensitive 
to, the helical twist of the filaments”*). Most 
of the chiral materials studied so far are 
passive or in thermal equilibrium. This means 
that their structural assembly is governed 
only by diffusion and by local interactions 


The path to 


between the constituent particles. But most 
naturally occurring materials are far from 
passive. Instead, they constantly consume 
energy so that their particles self-organize into 
higher-order structures — the cytoskeletons 
of cells are prime examples of this”””. 

The next step, therefore, is to apply the 
principles identified by Gibaud and colleagues’ 
to active systems. Imagine replacing the 
viruses used in this study with self-propelling 
chiral bacteria — would they still pack together 
in a two-dimensional membrane, and, if so, 
would the membrane move, or rotate, across 
mesoscopic or macroscopic distances? Could 
actively beating ribbons form, or would the 
noise stemming from active movement of the 
bacteria prevent the formation of such struc- 
tures? All we can say for certain is that chiral 
interactions will add a new twist not only to 
the self-assembly of colloids, but also to the 
self-organization of active materials. m 


retinoblastoma 


Genomic analyses of tumours of the childhood cancer retinoblastoma reveal a 
low mutation rate, challenging the view that genomic instability is crucial for its 
progression. The work also identifies a new therapeutic target. SEE ARTICLE 2.329 


JULIEN SAGE & MICHAEL L. CLEARY 


etinoblastoma is a rare tumour that 
Rites retinal cells in the eyes of 
children. Analyses of familial and 
sporadic cases of this cancer, backed by stud- 
ies in genetically engineered mice, have shown 
that loss of function of the tumour-suppressor 
protein RB1 (also known as RB) is required for 
the development of most, if not all, tumours 
of this type. However, it is not clear how RB1- 
deficient retinal cells progress to malignant 
tumour cells’. In addition, emerging evidence 
that loss of RB1 function can induce genomic 
instability” has raised the tantalizing possi- 
bility that RB1-deficient retinal cells might 
be predisposed to accumulating many addi- 
tional mutations, further complicating the 
identification of mutations that contribute to 
the development and maintenance of retino- 
blastoma. On page 329 of this issue, however, 
Zhang et al.’ demonstrate that retinoblastoma 
genomes have very few recurrent mutations in 
genes other than RB1. Instead, the expression 
of cancer-related genes is affected by epigenetic 
modifications on chromosomes, which do not 
affect DNA sequence but are inherited after 
cell division. 
To identify mutations that could cooperate 


with loss of RB1 function in tumour develop- 
ment, Zhang and colleagues’ sequenced and 
compared the genomes of normal tissue 
and retinoblastoma tumours from four 
patients. The researchers found that RB1 was 
the only known cancer-related gene consist- 
ently mutated, and that the retinoblastomas 
had 15-fold fewer total mutations than other 
types of solid tumour whose genomes have 
been sequenced’. 

Next, the authors searched for epigenetic 
alterations and for abnormal gene expression 
in retinoblastoma cells. They identified the 
gene that encodes the protein kinase enzyme 
SYK asa potential oncogene whose expression 
is consistently higher in retinoblastoma cells 
than in normal immature retinal cells. More- 
over, the activity of SYK was essential for the 
growth of retinoblastoma cells. The authors 
also show that certain small molecules that 
selectively inhibit SYK activity induce the 
death of retinoblastoma cells in a mouse model 
of the disease. 

These findings’ indicate that SYK may be 
a promising target for treating patients with 
retinoblastoma. SYK was not identified in 
previous searches for genes with a role in this 
cancer because it is not mutated or structur- 
ally rearranged in retinoblastoma. Therefore, 
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Zhang and colleagues’ study emphasizes the 
importance of high-throughput approaches 
that integrate genome sequencing with 
gene-expression analysis and epigenomics to 
identify cancer genes. 

Low mutation frequency has also been 
observed’ in medulloblastoma (a type of brain 
tumour that affects children), suggesting that 
it could be a general feature of childhood can- 
cers. A possible explanation for this difference 
between the cancers of children and adults is 
that childhood tumours arise in cells that are 
naturally undergoing rapid developmental 
growth, with fewer brakes on their proliferation 
than cells in adults. An alternative explanation 
is that, in children, these cells are negotiating 
crucial developmental checkpoints that are sus- 
ceptible to corruption, leading to incomplete 
or abnormal maturation. In both cases, only a 
few mutations would be needed to trigger the 
cellular changes associated with cancer. 

Furthermore, epigenetic changes in children 
and excessive mutations in adults may have 
similar roles in cancer development. Another 
childhood cancer, Wilms tumour, also has 
a relatively stable genome and displays an 
increased variation in DNA-methylation 
patterns compared with normal cells®. RB1- 
deficient retinal cells may be particularly 
susceptible to this tumour-formation mecha- 
nism, because RB1 interacts with the machin- 
ery that controls the epigenetic status of cells, 
including enzymes that remodel chromatin 
(DNA-protein complexes) and other enzymes 
that add methyl groups to DNA’. Thus, child- 
hood cancers such as retinoblastoma may 
carry epigenetic abnormalities that change 
gene expression and are stably propagated 
through subsequent cell divisions, helping to 
maintain tumour-associated features. 

If this model is correct, it is possible that 
RB1-deficient tumours in adults — such 
as small-cell lung cancer and some breast, 
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Genome-wide structure and organization 
of eukaryotic pre-initiation complexes 


Ho Sung Rhee! & B. Franklin Pugh! 


Transcription and regulation of genes originate from transcription pre-initiation complexes (PICs). Their structural and 
positional organization across eukaryotic genomes is unknown. Here we applied lambda exonuclease to chromatin 
immunoprecipitates (termed ChIP-exo) to examine the precise location of 6,045 PICs in Saccharomyces. PICs, 
including RNA polymerase II and protein complexes TFIIA, TFIIB, TFIID (or TBP), TFIIE, TFIIF, TFIIH and TFIIK were 
positioned within promoters and excluded from coding regions. Exonuclease patterns were in agreement with 
crystallographic models of the PIC, and were sufficiently precise to identify TATA-like elements at so-called 
TATA-less promoters. These PICs and their transcription start sites were positionally constrained at TFIID-engaged 
downstream +1 nucleosomes. At TATA-box-containing promoters, which are depleted of TFIID, a +1 nucleosome was 
positioned to be in competition with the PIC, which may allow greater latitude in start-site selection. Our genomic 
localization of messenger RNA and non-coding RNA PICs reveals that two PICs, in inverted orientation, may occupy the 
flanking borders of nucleosome-free regions. Their unambiguous detection may help distinguish bona fide genes from 


transcriptional noise. 


Assembly of the PIC and its post-assembly control are critical early 
steps in the transcription of eukaryotic genes. TBP (TATA-binding 
protein) arrives at most promoters as part of the multi-subunit TFIID 
complex that includes TBP-associated factors (TAFs)’. Together these 
proteins help recruit RNA polymerase (Pol) II and its entourage of 
general transcription factors (GTFs) to the transcription start sites 
(TSSs) of genes” *. These PICs assemble in nucleosome-free promoter 
regions (NFRs) that are flanked by an upstream —1 nucleosome anda 
downstream +1 nucleosome’. PICs have largely been defined bio- 
chemically using purified GTFs at a few model genes*~, but little is 
known about their assembly and organization in vivo, particularly at 
near-base-pair resolution on a genome-wide scale. 

An oddity of TBP is that when it is part of the TFIID complex, it 
tends to bind promoters that lack the TATA box consensus 
TATAWAWR (W indicates A/T; R indicates A/G)*. Approximately 
80-90% of all Saccharomyces genes are thus designated as “TATA- 
less’, and have a predominant PIC assembly mechanism and chro- 
matin architecture that differs substantially from those in the “TATA 
box class of genes®*”. So far, no TBP-binding motif has been identified 
at TATA-less promoters, and so the origins of TFIID-promoter 
specificity have been rather enigmatic'®. When TBP is not part of 
the TFIID complex, the SAGA complex directs TBP to TATA-box- 
containing Pol II promoters'!”’. 

TFIA and TFIIB clamp TBP to DNA, and make DNA contacts 
immediately upstream and downstream of the TATA box. TFIIB is a 
linchpin between TBP and Pol II'*"». Its intimate contact with Pol II 
directs how far downstream Pol II productively initiates transcrip- 
tion'®'’. TFIIF enhances the interaction of Pol II with TFIIB, assists 
in recruiting TFIIE, and promotes downstream elongation events*"*. 
TFIE then stimulates DNA strand separation by Pol Il at the TSS, and 
enhances the activity of TFITH. TFIIH holoenzyme is multi-functional, 
having ATP-dependent helicase (Ssl2 and Rad3) and kinase (Kin28) 
activities that reside on biochemically separable sub-complexes (TFIIH 
and TFIIK, respectively), both of which are key to efficient open com- 
plex formation and transcription initiation’. 


Weexamined the structural organization of PICs and their specificity 
on a genomic scale using ChIP-exo”’. This novel strategy substantially 
improved mapping resolution and eliminated many false positives. The 
exonuclease processively degrades a DNA strand in the 5'~3' direction 
until a crosslinking point is encountered (Supplementary Fig. 1a). The 
crosslinking inefficiency inherent to ChIP allows multiple crosslinking 
points to be detected in a population by deep sequencing. When applied 
to the GTFs on a genomic scale, we obtained detailed and comprehen- 
sive information on PIC structure and genomic organization. 


Genome-wide PIC structure 
We applied ChIP-exo genome-wide to Pol II and each GTF (Fig. 1a), 
and verified binding for TFIIB by locus-specific PCR using a series of 
tiled primers (Supplementary Fig. 1b). When exonuclease stop sites 
were mapped over all annotated mRNA promoters that contained a 
TATA box consensus, a distinctive pattern was observed (Fig. 1b). 
Importantly, each GTF displayed a strand-specific composite pattern 
of exonuclease stop sites that occurred only when TATA boxes, but 
not TSSs (Supplementary Fig. 2a and data not shown), were aligned, 
indicating that PICs are positioned with respect to the TATA box. 

For TFIIB, we detected four DNA crosslinking points (pairs of 
exonuclease stops), designated B1-B4 (Fig. 1c). The TATA box was 
precisely centred between B1 and B2, which were separated by 20 + 3 
base pairs (bp). Crosslinking point B3 and the diffuse B4 region indi- 
cate that TFIIB further crosslinked over a broad region downstream of 
the TATA box towards the TSS. We compared the four TFIIB cross- 
linking points to crystallographic-based models of open and closed 
TBP-TFIIB-Pol-II-promoter complexes (Fig. 1c)'*’°. An important 
caveat of the crystallographic models is they were built from multiple 
independent structures of truncated TFIIB-TBP-TATA, TFIIB-Pol 
II and Pol II elongation complexes. Thus, the combined structures 
represent a hypothetical organization. 

Within the modelled closed and open structures, crosslinking site 
B1 precisely (+ 3 bp) mapped to where the TFIIB carboxy-terminal 
core straddles the upstream DNA-binding stirrup of TBP (11 bp 
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Figure 1 | Genome-wide structural organization of PICs. a, Raw ChIP-exo 
tag distribution for GTFs and Pol II around the RPS14B gene. Filled plots 
represent unfiltered 5’ ends of sequencing tags on the sense (darker fill) and 
antisense strand (lighter fill). b, Average GTF and Pol II occupancy around the 
TATA box of 676 annotated mRNA genes. Plots are as in panel a. 


upstream of the TATA box midpoint). B2 mapped precisely (+ 3 bp) 
to where the TFIIB core amino-terminal cyclin fold encounters DNA 
just downstream of TBP’s other stirrup (9 bp downstream of the TATA 
box midpoint). B3 mapped to where the TFIIB linker region is closest to 
the DNA, which was 19 bp downstream of the TATA box midpoint. B4 
corresponded to a broad region defined by close proximity of the TFIIB 
reader (or finger) domain to single-stranded DNA within the modelled 
open complex, but was not evident within the closed complex. Similar 
broad regions of crosslinking were observed with the other GTFs 
(Fig. 1b), and may reflect indirect crosslinking. Support that these 
PICs represent open complexes is provided by permanganate reactivity 
studies of the GAL1, GAL10 and HSP82 loci****. Taken together, the 
entirety of the genomic crosslinking sites observed with the GTFs and 
Pol II fits remarkably well with the crystallographic models of the PIC 
open complex'*”*, and with many aspects of in vitro chemical cross- 
linking of these proteins’?**”’. 


TATA-like elements at TATA-less genes 

An apparent paradox of so-called TATA-less promoters is their 
utilization of TFIID, which has long been described as the TATA- 
box-binding complex*. However, it is unclear whether the TBP subunit 
of TFIID recognizes specific DNA sequences at TATA-less promoters. 
Inasmuch as TBP is expected to be found at all TATA-less promoters, 
and motif searching algorithms failed to identify candidate TBP-binding 
sites, we instead opted to search for sequence elements with up to two 
mismatches to the TATAWAWR consensus. We also limited our search 
to measured PIC locations. Remarkably, 99% of the PICs at TAT A-less 
promoters contained a sequence having two or less mismatches to the 
TATA box consensus (Fig. 2). We refer to these mismatched elements as 
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Figure 2 | Identification of TATA-like elements at TATA-less genes. Left, 
TFIIB occupancy around individual TFIIB-enriched TATA elements of mRNA 
genes (rows, n = 3,945), sorted by occupancy level. Occupancy on the sense 
(blue) and antisense (red) strands is shown with respect to TSS orientation. 
Right, a colour chart representation of the DNA sequence located + 20 bp from 
the TATA element midpoint and ordered as shown in the left panel. 
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“TATA-like’, as they did not form a consensus, whereas those conform- 
ing to the consensus retain the “TATA box’ designation. We refer to the 
two elements together as “TATA elements’. 

To assess whether TFIIB was positioned around these TATA-like 
elements in a canonical manner as seen at bona fide TATA boxes, 
strand-specific ChIP-exo tags were plotted around each element, sepa- 
rated into panels by 0, 1 or 2 mismatches to the TATA box consensus 
(Fig. 2). Notably, regardless of its occupancy level, the distribution of 
TFIIB crosslinking and thus its positioning relative to these TAT A-like 
elements was quite similar to the positioning observed at bona-fide 
TATA boxes. When the other GTFs were examined, their patterns 
relative to TATA-like elements were also similar to those found at 
TATA boxes (Supplementary Fig. 2b), although some downstream 
differences were observed (addressed later). Thus, as previously seen 
at the three yeast TATA-box-containing genes GALI, GAL10 and 
HSP82 (refs 23, 24), and in mammalian in vitro systems”®, at least the 
upstream portion of most PICs are positioned with respect to resident 
TATA elements nearly identically, regardless of their Pol II promoter 
classification as TATA-box containing or TATA-less. Although the 
‘TATA-less’ designation may be a misnomer, this class of genes is not 
simply a slight variation of the TATA class, but instead has predom- 
inant regulation by TFIID versus SAGA, positive versus negative regu- 
lation by chromatin, and lower plasticity of expression®”. 


TATA-less TSS positioning by nucleosomes 

Permanganate reactivity experiments detect open complex formation 
upstream of the TSS at the GALI, GAL10 and HSP82 TATA-box- 
containing promoters****. These and other studies'””’ have led to the 
notion that Pol II scans downstream from the open complex to find 
the TSS. In agreement with this being a general mechanism, we find 
that PICs of TATA-box-containing genes generally reside upstream 
of the TSS (Supplementary Fig. 2a). 

TAFs are largely depleted at TATA-box-containing promoters, 
although they are not entirely absent (Fig. 3a). The low level of Tafl 
that was present tended to be positioned similarly to TBP and other 
GTFs (Fig. 1b and Supplementary Fig. 2a, b). In contrast, Tafl- 
enriched/TATA-less promoters (which are related, as shown in 
Supplementary Fig. 3) showed additional interactions downstream 
of the TSS that exactly coincided with the size and location of the 
+1 nucleosome (Fig. 3a). Indeed, Tafl displayed a more uniform 
positioning pattern in relation to the TSS and +1 nucleosome than 
to TATA elements, which suggests that at least part of the TFIID 
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TAF complex engages and is positioned by the +1 nucleosome at 
TATA-less promoters. Consistent with this, Bdf1, which is considered 
to be a missing piece of Tafl (ref. 30), binds to the +1 nucleosome”. 
Furthermore, Bdfl showed a nearly identical ChIP-exo pattern to that 
of Tafl (Supplementary Fig. 4). TFI[D-nucleosomal interactions 
have also been reported in mammalian systems, although the details 
may differ’. 

If TFIID binds simultaneously to both the +1 nucleosome and a 
TATA element, then the intervening Pol II would seem to be fenced in 
by TFIID, thereby limiting its ability to scan DNA. This model pre- 
dicts that the TSS would reside closer to the TATA element and be 
positionally restricted relative to the +1 nucleosome, compared to 
Tafl-depleted/TATA-box-containing promoters. Indeed, the TSS at 
TATA-less promoters resided ~ 10-20 bp closer to the TATA element 
than at TATA-box-containing promoters (Supplementary Fig. 2a). 

We also compared the position of TATA elements and TSSs in 
relation to the +1 nucleosome. We separately examined individual 
Tafl-depleted/TATA-box-containing and Tafl-enriched/TATA-less 
promoters (Fig. 3b). Strikingly, at the Tafl-enriched promoters, 
the TSS was tightly positioned at the border between the 5’ NFR 
and the +1 nucleosome in comparison to Tafl-depleted/TATA- 
box-containing promoters. The latter had TSSs distributed across 
the adjacent nucleosome position, and these nucleosomes were rela- 
tively depleted compared to the Tafl-enriched class (Fig. 3a). 

Taken together, we interpret these observations as reflecting distinct 
functions of the + 1 nucleosome at the two classes of genes. Nucleosomes 
and PICs of the TFIID-enriched/TATA-less class might cooperatively 
assembly, in which case the nucleosome may be instructive for TSS 
selection by impeding Pol II scanning (Fig. 3c). In contrast, nucleosomes 
and PICs of the Tafl-depleted/TATA-box-containing class may com- 
petitively assemble. This may allow for the greater stochasticity or plas- 
ticity of expression that is characteristic of this class’, in which 
nucleosome loss would prime the gene for a high level of transcription. 
A nucleosome competition mechanism removes an impediment to Pol 
II scanning. Pol II could scan further, thereby allowing productive ini- 
tiation at specific DNA elements”. The transition into a scanning state 
may be rate limiting in the scanning cycle, as the PIC is detected 
upstream of the TSS. 


GTF depletion in genes and their termini 


Although it is clear that GIFs assemble in promoter regions of mRNA 
genes and disengage Pol II within ~150 bp of the TSS**”, it is less 
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Figure 3 | PIC organization in relation to TFIID and the +1 nucleosome. 
a, GTF occupancy around the nearest nucleosome position (essentially +1) to 
an mRNA PIC, which were sorted by the distance between the two. Unfiltered 
tags on each strand were shifted in the 3’ direction by a fixed distance (~8 bp 
depending on each GTF, 73 bp for nucleosomes), so as to reflect better the 
points of crosslinking. Tafl-depleted (Tafl-depl.) and Tafl-enriched genes 
were determined as being distinct clusters when GTF occupancies of all genes 
were clustered by k-means (see Fig. 5a). For all graphs of this type, image 


resolution is less than the number of rows, resulting in some averaging and thus 
the appearance of less variance across adjacent rows. See Supplementary Data 2 
for underlying values, which can be visualized in Treeview. RP indicates 
ribosomal protein genes. The right panel shows transcription frequency**. The 
nucleosome borders are denoted by vertical dashed black lines. b, Same as panel 
a, but showing an overlay of TATA elements, TFIIB and TSS. c, Model of PIC 
organization at TATA-box-containing and TATA-less/TFIID-dependent 
genes. 


00 MONTH 2012 | VOL 000 | NATURE | 3 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


clear to what extent they assemble across the body of genes, or at genes 
that are either transcriptionally silent or classified as ‘dubious’. A 
whole genome view of GTFs and Pol II is presented in Fig. 4a. 
Remarkably, PICs were almost entirely excluded from coding regions, 
regardless of gene activity, whereas Pol II was enriched across gene 
bodies, as expected. Approximately 90% of dubious open reading 
frames (ORFs) lacked a canonical PIC organization or contained PICs 
within the ORF, and thus are unlikely to be coding. Thus, coding region 
PIC-driven initiation, whether in the sense or antisense direction, is 
infrequent on the scale of what is seen at mRNA promoters. 
Moreover, the observed GTF pattern makes clear that Pol II disengages 
all GTFs at the promoter. 

Much less is known of the fate of Pol II at the ends of genes, as it 
undergoes termination. To examine the 3’ ends of genes, without the 
complications associated with nearby mRNA promoters, we sepa- 
rated 3’ ends into those having nearby 3’ or 5’ ends of an adjacent 
gene (Fig. 4b). Within terminal intergenic regions, GIFs were highly 


depleted, indicating that PICs rarely exist at the 3’ ends of genes at 
levels seen for mRNA genes (although lower levels do exist). 

Remarkably, we find a highly correlated enrichment of Pol II and 
TFHH (Ssl2) but not TFIIK (Kin28), at the end of genes within 3’ 
NFRs (Fig. 4b). Such a physical separation of the TFIIH/Ssl2 and 
TFIHK/Kin28 submodules of holo-TFIIH in genome-wide binding 
experiments has not previously been reported, but may be in accord 
with their biochemistry’**'. However, Ssl2 is biochemically separable 
from the TFHH core, which therefore prompted us to examine 
additional core TFIIH subunits, Ssll and Tfb1. Surprisingly, both 
were absent from the ends of genes (Supplementary Fig. 6), although 
they were present within PICs at promoters. These results suggest that 
Ssl2, a 3'-5’ helicase, operates independently of holo-TFIIH at the 
ends of genes. Consistent with a possible role of Ssl2 in transcription 
termination, Ssl2 has functional interactions with the Hsp90 protein 
chaperone**, which has been implicated in the disassembly of the 
transcription machinery”. 
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Figure 4 | Genomic view of PICs in relation to genes. a, GIF occupancy 
around transcript and ORE start sites’’, sorted by gene length. See 
Supplementary Data 3-5 for underlying values. Transcript or ORF ends are 
indicated by black dashed and solid lines, respectively. The right panel shows 
transcription frequency**. b, GTF distribution around the 3’ ends of mRNA 
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Distance from annotated TSS (kb) 


genes, sorted by intergenic length, and sectioned by convergent versus tandem 
gene-pairs. Occupancy at eight reported looped genes*”*® is shown in 
Supplementary Fig. 5. c, GIF distribution around the TSS of mRNA genes, 
sorted by intergenic length, and sectioned by inverted versus tandem gene- 
pairs. 


©2012 Macmillan Publishers Limited. All rights reserved 


Divergent transcription from distinct PICs 

In contrast to coding regions, PICs were abundant in intergenic 
regions, far beyond what could be accounted for at mRNA promoters 
(Fig. 4c). Divergent transcription, in which mRNA and non-coding 
(nc)RNA initiation occurs within the same region but elongates in 
opposite directions, is well established in eukaryotes***!. However, it 
has been unclear whether divergent transcription originates from the 
same PIC site. Conceivably, the entire PIC or a portion thereof might 
assemble in either direction. As shown in Fig. 4c, even the shortest 
(~120 bp) 5’ intergenic regions of mRNA genes with inverted orienta- 
tion were associated with two PICs, one for each mRNA direction. 
Thus, divergent mRNA transcription originates from two distinct 
PICs, even when arising from the same NFR. 

We next examined the composition and location of PICs associated 
with mRNA and ncRNA (variously classified as cryptic unstable tran- 
scripts (CUTs), stable uncharacterized transcripts (SUTs), Xrn1- 
sensitive unstable transcripts (XUTs) and antisense (AS))**“*. We 
also examined orphan PICs, which we defined as being >160 bp from 
any annotated TSS or ORF start site. Nearly all had the same relative 
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composition of GTFs, including Tafl depletion or enrichment 
(Fig. 5a), although mRNA PICs generally had higher occupancy 
levels. GIFs had highly correlated occupancies at all PICs (see Sup- 
plementary Fig. 7 for mRNA PICs). ncRNA PICs were generally 
organized around an adjacent nucleosome, as seen for mRNA PICs 
(Supplementary Fig. 8). Thus, all mRNA, ncRNA and orphan PICs 
are compositionally homogeneous with regards to the GTFs (exclud- 
ing TAFs). 

To visualize better the context of the low-occupancy ncRNA PICs 
with mRNA genes, we marked ncRNA PIC locations by their TATA 
element, and plotted their directionality with respect to nearby 
mRNA (Fig. 5b). We observed a general trend where ncRNA and 
mRNA PICs were positioned in opposite directions 150-200 bp apart. 
This places both PICs within the same NFR, and thus within the same 
canonical nucleosome architecture, as seen for two divergent mRNA 
PICs that share the same NFR (Supplementary Fig. 9). 

Low-occupancy ncRNA PICs were also found towards the 3’ ends 
of mRNA genes, of which the majority were antisense to the mRNA 
(Fig. 5b), and associated with low expression of the sense mRNA 
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Figure 5 | Distribution of ncRNA PICs. a, GTF occupancy levels at PICs for 
mRNA, ncRNA and orphans, sectioned by k-means clustering. Occupancy 
levels of each GTF were median normalized, log, transformed, then sorted by 
row median. The small M1 group represents a terminating polymerase 
originating from upstream. b, Distribution of ncRNA PICs relative to mRNA 


Distance from mRNA TSS (kb) 


genes. mRNA genes were filtered to retain only those having a nearby ncRNA- 
associated PIC (defined as having >10% of the genome-wide TFIIB average). 
Plotted are the locations of TATA elements associated with the mRNA (grey), 
sense-directed ncRNA (blue), and antisense-directed ncRNA (red). Additional 
plot details are as described in Fig. 4. 
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(Supplementary Fig. 10a). Thus, ncRNAs, which tend to be anti- 
sense”, are generally associated with repression when residing in gene 
bodies. 

In total, we identified ~6,000 PICs in rapidly growing yeast cells, in 
which the PICs had an occupancy level of >10% of the genome 
average. More than 98% of these PICs had a TATA element precisely 
where TBP bound. Approximately 70% of the identified PICs were 
associated with mRNA genes (Supplementary Fig. 10b). The remain- 
ing ~30% were divided evenly between ncRNA and orphans. At 
lower detection thresholds, many more low-occupancy PICs could 
be identified. We do not believe that they represent technical noise, as 
they are highly enriched in NFRs in which mRNA and ncRNA PICs 
are found. They might produce low levels of promoter-specific basal 
transcription. 


Unifying principles of PICs 


Our data suggest that with the exception of TAFs, PICs are compo- 
sitionally homogeneous in regards to GTFs at coding and non-coding 
Pol II transcription units in the yeast genome. PICs differ markedly in 
occupancy levels, which is in accord with their transcription fre- 
quency. PICs tend to form at NFR/nucleosome borders at the 5’ 
end (and to some extent at the 3’ end) of genes, where they direct 
either mRNA or ncRNA transcription away from the NFR. As such, 
an NFR may normally accommodate two divergently oriented PICs at 
markedly different occupancy and transcription levels. These occu- 
pancy levels do not strictly correlate, which suggests largely independ- 
ent control of the two PICs. 

PIC assembly, orientation and positioning may be contributed to in 
part by the resident TATA element, as well as through sequence- 
specific factors and co-factors. TFIID-regulated promoters may rely 
less on TATA element strength, and more on an NFR-adjacent 
nucleosome for PIC assembly, orientation and positioning. The adja- 
cent nucleosome might also serve to impede a scanning Pol II so that it 
can productively select a TSS at a focused position just inside the 
nucleosome. At SAGA-regulated promoters, which tend to contain 
a consensus TATA box, nucleosome occupancy may be more com- 
petitive with PIC assembly, wherein the strength of TBP/TATA inter- 
actions would be more important for PIC assembly, orientation and 
positioning. As such, there would be no nearby nucleosome to impede 
polymerase scanning, which would allow TSS selection to be con- 
trolled by other factors, including DNA sequence. 

The emergent concept of ncRNA and the difficulty of distinguishing 
random transcriptional noise from specific initiation raise the question 
as to what constitutes a gene*. The unambiguous and precise mapping 
of PIC locations across a genome, as described here, might help define 
the start of individual genes. 


METHODS SUMMARY 


Saccharomyces strains (BY4741) bearing TAP-tagged GTF or Pol II subunits (or 
untagged TBP) were grown to exponential phase in yeast extract peptone dextrose 
(YPD) media (30°C to OD¢oonm = 0.8), then subjected to 1% formaldehyde 
crosslinking for 15min at 25°C. Cells were harvested and washed. Sonicated 
chromatin was prepared by standard methods. Standard ChIP methods were 
used, followed by lambda exonuclease treatment and library construction, as 
described elsewhere”. Libraries were sequenced by an ABI SOLiD sequencer. 
Figures displaying strand-specific sequencing tags represent the raw data without 
normalization to input. TFIIB peak calls were made with GeneTrack software**. 
PICs (m = 6,045) were identified as having a TFIIB peak-pair in at least two out of 
four biological replicates and having =33 sequence tags (> 10% of average TFIIB 
occupancy)”. PICs were assigned to the nearest TSS within + 200 bp, with 
mRNA” having precedence over ncRNA. For this purpose, ORFs lacking a 
TSS (from the Saccharomyces Genome Database) were assigned a hypothetical 
TSS based on the genome-wide consensus. PICs of ncRNA were assigned to the 
nearest TSS within + 200 bp of SUTs, CUTs, ASs and XUTs*™*, with SUTs/ 
CUTs having precedence over AS/XUTs. To assign directionality to orphan PICs, 
we compared nucleosome occupancy on the lower versus higher coordinate side 
of TFIIB locations. If the higher coordinate had higher nucleosome occupancy it 
was classified as ‘sense’, otherwise it was ‘antisense’. We validated this method by 
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applying it to mRNA PICs, and found >91% of the assignments to be correct. We 
searched for TATA elements between —80 to +20 bp of the midpoint of 6,045 
TFIIB-bound locations on the sense strand, first by searching for the consensus 
TATAWAWR, then for one and then two mismatches to the consensus. The 
TATA element closest to —28 bp of a TFIIB peak had precedence if multiple 
elements were found. 
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DNA breaks and chromosome 
pulverization from errors in mitosis 
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The involvement of whole-chromosome aneuploidy in tumorigenesis is the subject of debate, in large part because of the 
lack of insight into underlying mechanisms. Here we identify a mechanism by which errors in mitotic chromosome 
segregation generate DNA breaks via the formation of structures called micronuclei. Whole-chromosome-containing 
micronuclei form when mitotic errors produce lagging chromosomes. We tracked the fate of newly generated 
micronuclei and found that they undergo defective and asynchronous DNA replication, resulting in DNA damage and 
often extensive fragmentation of the chromosome in the micronucleus. Micronuclei can persist in cells over several 
generations but the chromosome in the micronucleus can also be distributed to daughter nuclei. Thus, chromosome 
segregation errors potentially lead to mutations and chromosome rearrangements that can integrate into the genome. 
Pulverization of chromosomes in micronuclei may also be one explanation for ‘chromothripsis’ in cancer and 
developmental disorders, where isolated chromosomes or chromosome arms undergo massive local DNA breakage 


and rearrangement. 


Whole-chromosome aneuploidy is a major feature of cancer genomes, 
yet its role in tumour development remains controversial'’. This con- 
trasts with chromosome breaks and rearrangements, which are known 
to produce cancer-causing mutations. Recent genetic evidence demon- 
strates that increased rates of whole-chromosome mis-segregation can 
accelerate oncogenesis* °; however, the only established mechanism by 
which whole-chromosome segregation errors promote tumorigenesis 
is by facilitating the loss of heterozygosity for tumour suppressors’. 
Intriguingly, two animal models where whole-chromosome segrega- 
tion errors result in robust tumour development also display extensive 
structural alterations in chromosomes®*. This raises the question of 
whether errors in mitosis can predispose to DNA damage. We con- 
sidered the possibility that segregation of chromosomes into micro- 
nuclei might produce DNA damage. Whole-chromosome-containing 
micronuclei form from anaphase lagging chromosomes”’; micronuclei 
can also be generated from acentric fragments of chromosomes". 
Micronuclei have many features of primary nuclei, but much contro- 
versy surrounds their actual composition and functional properties. 
Studies differ on whether micronuclei are transcriptionally active, rep- 
licate DNA, mount a normal DNA damage response, or assemble 
normal nuclear envelopes; moreover, the ultimate fate of chromosomes 
trapped within micronuclei remains unclear’!"*"°. 


DNA damage in micronuclei 


To determine if newly formed whole-chromosome micronuclei develop 
DNA damage, we generated micronuclei in synchronized cells and 
tracked them through the cell cycle. As a first synchronization approach, 
micronuclei were generated in non-transformed RPE-1 and trans- 
formed U2OS cells by release from nocodazole-induced microtubule 
depolymerization. When mitotic cells are released from nocodazole, 
spindles reassemble abnormally, producing merotelic kinetochore 
attachments (one kinetochore attached to two opposite spindle poles), 
lagging chromosomes, and ~10% of cells with micronuclei’®. Because 


prolonged mitotic arrest causes DNA damage’, RPE-1 cells were 
arrested for a short (6h) interval with nocodazole. After release from 
the 6h nocodazole block, neither the primary nuclei nor the newly 
formed kinetochore-positive micronuclei showed significant DNA 
damage during the subsequent G1 phase as measured by damage- 
dependent phosphorylation of the histone variant H2AX”° (y-H2AX 
foci formation; Fig. la~c and Supplementary Fig. 2a-f), TdT-mediated 
dUTP nick end labelling (TUNEL) to detect DNA breaks”! (Fig. 1d and 
Supplementary Fig. 2e, f), and visualization of DNA breaks with the 
comet assay” (Supplementary Fig. 3d, e). 

Because aneuploidy can produce a p53-dependent G1 cell-cycle 
arrest****, p53 was silenced by RNA interference (RNAi) to allow us 
to monitor the fate of micronucleated RPE-1 cells at later stages of the 
cell cycle. As expected, after S-phase entry, low-level DNA damage 
was detected in both the micronuclei and the primary nucleus”; 
however, in G2 phase cells, most micronuclei showed DNA damage 
whereas almost none was detected in the primary nucleus (Fig. la—d). 
Similar results were observed in U2OS cells (Supplementary Fig. 3) 
and in cells where merotelic kinetochore-microtubule attachments 
were generated by knockdown of the kinetochore-associated micro- 
tubule depolymerase MCAK”® or the kinetochore protein NUF2 
(ref. 27) (Supplementary Fig. 4a—c). MCAK knockdown does not 
delay cells in mitosis, demonstrating that the acquisition of DNA 
damage in micronuclei is independent of mitotic arrest'”°. This 
damage did not represent activation of apoptosis because it was not 
accompanied by caspase-3 activation, and it was not suppressed by a 
pan-caspase inhibitor (Supplementary Fig. 5). 

Asa completely independent method for generating micronuclei, we 
used a human cell line (HT1080) carrying a human artificial chro- 
mosome (HAC) with a kinetochore that could be conditionally inacti- 
vated”*. In this system, kinetochore assembly on the HAC is blocked by 
washout of doxycycline from the medium; consequently, the HAC is 
unable to attach to the mitotic spindle and is left behind at anaphase, 
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Figure 1 | Micronuclei from lagging chromosomes develop DNA breaks. 

a, Schematic of the experiment. b, Representative images of micronucleated G1 
(6h), S (16h), G2 (22 h) and irradiated RPE-1 cells (IR) labelled for DNA (blue), 
centromeres (ACA, red) and y-H2AX (green). Insets show enlarged images of 
micronuclei. Scale bars, 10 jum. c, d, Percentage of primary nuclei (PN, blue 
bars) and centric micronuclei (MN, red bars) with y-H2AX foci (c) and TUNEL 
labelling (d) (3 experiments, m = 100). Errors bars indicate s.e.m. 


ultimately reforming as a micronucleus (Fig. 2a). Results obtained with 
the HAC-containing micronuclei (Fig. 2b-d) were in agreement with 
the results obtained by the other synchronization methods described 
above. Taken together, micronuclei do not exhibit significant DNA 
damage during Gl, but a large fraction of micronuclei acquire DNA 
damage during S phase that persists into G2. 


Defective DNA replication in micronuclei 


To determine directly whether acquisition of DNA damage in micro- 
nuclei requires DNA replication, synchronized micronucleated cells 
were released into medium containing thymidine to block DNA rep- 
lication. Blocking DNA replication with thymidine abolished the 
acquisition of DNA damage (Fig. 3a-c), demonstrating that the 
breaks in micronuclei occur in a replication-dependent manner. 

We next tested whether the acquisition of DNA damage in micro- 
nuclei is mediated by aberrant DNA replication. This possibility was 
suggested because micronuclei in G2 cells that had y-H2AX foci were 
almost always additionally labelled for phosphorylated RPA2 (Ser 33)”, 
a marker of DNA replication stress (Supplementary Fig. 4d). Moreover, 
the characteristic 5-bromodeoxyuridine (BrdU) labelling patterns of 
early, mid and late S-phase cells” suggested that DNA replication in 
micronuclei is inefficient and asynchronous with the primary nucleus 
(Fig. 3d). By measuring the fluorescence intensity of BrdU relative to 
DNA area from single focal plane confocal images, DNA replication in 
micronuclei was detected at a markedly reduced level throughout S 
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phase when compared to the primary nuclei (Fig. 3e). Notably, pulse- 
labelling with BrdU demonstrated asynchronous DNA replication of 
the micronuclei relative to the primary nucleus, with ~30% of micro- 
nuclei replicating DNA in G2 cells (22 h; Fig. 3d). 

The compromised DNA replication observed in newly formed 
micronuclei led us to test whether complexes necessary to license 
DNA replication origins are recruited to chromosomes in micronuclei. 
The first step in assembling replication origins is the loading of the 
origin recognition complex (ORC) after chromosome decondensation 
during telophase*’. Quantitative fluorescence imaging suggested that 
ORC recruitment is equally efficient in the micronuclei as in the 
primary nucleus (Supplementary Fig. 6a). By contrast, newly generated 
micronuclei showed significant reduction in the recruitment of repli- 
cative DNA helicase components MCM2 and MCM3 (refs 32, 33) as 
well as the replication initiation factor CDT1 (ref. 31) (Fig. 4a, b and 
Supplementary Fig. 6b, c). We observed an increased recruitment of 
MCM subunits into micronuclei as cells progressed through the cell 
cycle, but the levels of MCM subunits never exceeded 20% of that 
observed in the primary nucleus (Fig. 4b). 

To examine whether the DNA damage response in micronuclei was 
also abnormal, cells were treated with replication inhibitor (aphidicolin) 
or exposed to 2 Gy irradiation to induce damage and then labelled to 
detect components of the DNA damage response. After aphidicolin 
treatment, micronuclei showed clear signal for y-H2AX and its kinase 
ATR (ataxia telangectasia mutated-related), but downstream constitu- 
ents of the DNA damage response such as 53BP1 were not efficiently 
recruited, as previously reported** (Supplementary Fig. 7). A similar 
defect in the recruitment of DNA-damage-response factors into micro- 
nuclei was also observed after irradiation (Supplementary Fig. 8a). This 
recruitment defect had clear consequences for DNA damage repair: 
whereas -H2AX foci were resolved in primary nuclei by 6h after 
2 Gy irradiation, y-H2AX foci persisted in micronuclei for >24h 
(Supplementary Fig. 8b). The failure of micronuclei to recruit MCM 
subunits and components of the DNA repair machinery may be due, 
at least in part, to a defect in nucleocytoplasmic transport. Newly formed 
micronuclei have a marked reduction in the density of nuclear 
pores'''*!> (Supplementary Fig. 9) and exhibit strongly reduced nuclear 
import (Fig. 4c, d and Supplementary Fig. 10). 


Micronuclei and chromosome pulverization 


Next we tested the prediction that abnormal DNA replication in 
micronuclei could generate chromosome breaks. Chromosome 
spreads were prepared from non-transformed cells in the first cell cycle 
after release from nocodazole or from DMSO-treated controls 
(Fig. 5a-f and Supplementary Fig. 11a). Notably, 7.6% of chromosome 
spreads prepared from micronucleated cells released from nocodazole 
block (n = 722) exhibited chromosomes with a ‘pulverized’ appear- 
ance, characterized by fragments of 4',6-diamidino-2-phenylindole 
(DAPI)-stained material that were often clustered into a discrete loca- 
tion on the spread* (Fig. 5a, d (yellow arrows), i and Supplementary 
Fig. 11b). By contrast, pulverization was exceedingly rare (0.14%) in 
controls. Multicolour fluorescence chromosome painting (SKY) con- 
firmed that 34 of 47 spreads with pulverized chromosomes were 
composed of fragments from a single chromosome and another eight 
were from two chromosomes (Fig. 5a-i and Supplementary Fig. 11). 
Despite this marked disruption to chromosomal structure, only one 
cell harboured non-reciprocal chromosome translocations 24h after 
nocodazole release (1 = 101), whereas no translocations were observed 
in control cells (data not shown). At 72h after nocodazole release 
(n = 100), we still did not detect chromosome translocations, but we 
did detect truncated or derivative single chromosomes in three cells. 
Although our experiments show that whole-chromosome mis- 
segregation does not produce frequent non-reciprocal translocations, 
we do not exclude the possibility that the DNA damage acquired in 
micronuclei could result in translocations, albeit at a frequency lower 
than ~1/100 per generation. 
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Figure 2 | DNA breaks in a HAC targeted to a micronucleus. a, Right: 
schematic; left: fluorescence in situ hybridization images of HAC (red) in a 
primary nucleus (+Dox) or micronucleus (— Dox). b, Images of 
micronucleated cells as in Fig. 1b (enlarged and brightened in insets). Scale bars, 


Chromosome pulverization has been observed in cell-fusion 
experiments where chromosomes from an S-phase cell are pulverized 
because of exposure to signals from mitotic cytoplasm”. It has also 
been observed when aberrant late-replicating chromosome transloca- 
tions are generated’’. The mechanism of pulverization involves com- 
paction of partially replicated chromosomes induced by mitotic 
cyclin-dependent kinase activity, termed premature chromosome 
compaction*****’. By labelling micronucleated and control cells for 
2h with BrdU immediately before chromosome condensation and 
mitotic arrest, we found that 25 of 30 pulverized chromosomes were 
undergoing DNA replication (Fig. 5h). The fact that only micronuclei 
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10 um. ¢, d, Percentage of primary nuclei (blue bars) and centric micronuclei 
(red bars) with y-H2AX foci (c) and TUNEL labelling (d) (3 experiments, 
n= 100). Errors bars indicate s.e.m. 


incorporate BrdU in G2 cells (Figs 3d and 5g) demonstrates that the 
pulverized chromosomes are derived from micronuclei. 


The fate of chromosomes in micronuclei 


We next addressed the potential for chromosome aberrations 
acquired in micronuclei to be reincorporated into the genome. We 
used long-term live-cell imaging to determine the fate of sponta- 
neously arising or experimentally induced micronuclei in a variety 
of cell lines (Supplementary Table 1). Imaging analysis revealed that 
the vast majority of micronuclei (525 of 541; 97%) are stably main- 
tained during interphase. Although micronuclei may be extruded in 
some cells’, extrusion was not detected in our imaging experiments. 
Furthermore, micronuclei were not visibly degraded”, did not co- 
localize with lysosomes (Supplementary Fig. 12), and did not appear 
to fuse with the primary nucleus. By contrast, our experiments indi- 
cated that after nuclear envelope breakdown some micronuclei might 
join the other mitotic chromosomes and be distributed to daughter 
nuclei. Of the 331 micronucleated cells that were tracked as they 
underwent cell division, 16% (53 of 331) gave rise to daughter cells 


Figure 3 | DNA damage in micronuclei results from aberrant DNA 
replication. a—c, DNA replication requirement for acquisition of DNA damage 
in micronuclei. a, RPE-1 cells were synchronized as in Fig. 1a and released into 
media with (+TH) or without (—TH) 2 mM thymidine. Cells were co-labelled 
for TUNEL (green) and cyclin B1 (red). Shown are images from the 22h 
sample. Scale bar, 10 um. b, c, Percentage of TUNEL-positive primary nuclei 
(blue bars) and micronuclei (red bars) with (c) or without (b) thymidine 
treatment; (3 experiments, m = 100). Errors bars indicate s.e.m. d, Inefficient 
and asynchronous DNA replication in micronuclei. RPE-1 cells as in Fig. la 
were pulse-labelled with BrdU and labelled: DNA (white) and BrdU (red). 

e, The ratio of BrdU incorporation in micronuclei relative to primary nuclei 
after a 30-min pulse label. Normalized fluorescence intensity (FI) 
measurements are as shown in the box and whisker plots (2 experiments, 
n= 50). Boxes represent upper and lower quartiles, lines within boxes 
represent median, and the error bars comprise the whiskers which extend to the 
maximum and minimum value data sets. 
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Figure 4 | Defective MCM2-7 complex recruitment, DNA damage response 
and nucleocytoplasmic transport in micronuclei. a, Impaired MCM2 
recruitment into micronuclei. G1 RPE-1 cells were synchronized as in Fig. la 
and stained for chromatin-bound MCM2 and DNA. Scale bar, 10 um. 

b, Relative MCM2 fluorescence intensity (FI) as in Fig. 3e. Approximate cell 
cycle stage of time points: 6 h, G1; 10h, early S phase;16 h, mid S phase; 20 h, late 
S phase; (2 experiments, n = 50). c, d, Micronuclei are partially defective for 


Figure 5 | The fate of chromosomes in micronuclei. Micronuclei were 
induced in RPE-1 cells as described in Fig. 1a (after p53 knockdown) and 
chromosome spreads were prepared 24h after nocodazole release. 

a-c, Pulverization of chromosome 1 demonstrated by DAPI staining 
(chromosome fragments, brightened in inset, yellow arrows) (a), SKY probes 
(pseudo-coloured) (b), and aligned SKY karyotype (c). d-f, Pulverization of 
chromosome 16, viewed as in a-c. g, A BrdU-positive micronuclei-containing 
G2 RPE-1 cell. 2h BrdU pulse label. DNA, white; BrdU, red. h, Selective BrdU 
labelling of a pulverized chromosome. i, The percentage of cells with intact 
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nuclear import of NFATc1-EGFP. c, Representative images of micronucleated 
U20S cells stably expressing NFATc1-EGFP in G1 and G2 with or without 
treatment with 0.2 11M thapsigargin for 10 min. d, Ratio of NEFATcl1-EGFP 
fluorescence intensity between micronuclei and primary nuclei; (3 
experiments, m = 30). For b and d, error bars comprise the whiskers which 
extend to the maximum and minimum value data sets. 
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micronuclei (blue bars) or pulverized chromosomes (PC, red bars) from 
control or nocodazole-released (NOC) RPE-1 cells. Interphase is 18-h sample 
and metaphase is 24h. j, Images from live-cell experiment showing a pre- 
converted green fluorescent micronucleus (white arrows) photo-converted to a 
red-fluorescent micronucleus (yellow arrows) imaged through mitosis. After 
anaphase, the micronucleus either reincorporated with the primary nucleus or 
failed to reincorporate and reformed as a micronucleus. Top row: 
reincorporation of micronuclei into primary nucleus; bottom row: no 
reincorporation. Time is shown as h:min. 
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that lacked micronuclei (Supplementary Table 1 and Supplementary 
Movies 1 and 2). 

To test directly whether micronuclei reincorporate during mitosis, 
we generated a U2OS cell line stably expressing H2B fused to the photo- 
convertible fluorescent protein Kaede*’, which converts from green to 
red fluorescence emission after ultraviolet illumination. This enabled us 
to photoactivate selectively and subsequently track a single chro- 
mosome encapsulated in a micronucleus throughout mitosis (Fig. 5j 
and Supplementary Movies 3 and 4). This experiment demonstrated 
that chromosomes within micronuclei reincorporate into daughter 
nuclei at a significant frequency (11 of 29; 38%). The remaining micro- 
nuclei persisted in cells well into the second generation. 


Discussion 

Together, our findings indicate a novel mechanism by which whole- 
chromosome segregation errors can cause chromosome breaks and 
potentially cancer-causing mutations. Mitotic errors can lead to lagging 
chromosomes which in turn can be partitioned into micronuclei’. This 
can produce DNA damage in two ways. First, DNA damage can result 
as a direct consequence of aberrant DNA replication, potentially due 
to a reduced density of replication origins. Second, chromosome pul- 
verization due to premature chromosome compaction can occur if a 
micronucleated cell enters mitosis with micronuclei still undergoing 
DNA replication. This may not be the only mechanism by which 
whole-chromosome mis-segregation can cause DNA damage, as it 
was recently proposed that lagging chromosomes can be damaged by 
the cytokinetic furrow“. In contrast to previous studies that have sug- 
gested that most micronuclei are inactive, discarded or degraded", we 
find that micronuclei persist over several generations but that the chro- 
mosomes within the micronuclei can be segregated at a significant 
frequency into daughter cell nuclei. Thus, DNA rearrangements and 
mutations acquired in micronuclei could be incorporated into the 
genome of a developing cancer cell. 

Our experiments suggest a mechanism that could explain the recently 
discovered phenomenon of chromothripsis**’. Chromothripsis was 
discovered by cancer genome sequencing and DNA copy number ana- 
lysis. It is defined by small-scale DNA copy number changes and 
extensive intrachromosomal rearrangements that are restricted to a 
single chromosome or chromosome arm. Two non-exclusive models 
have been proposed for chromothripsis are: (1) fragmentation of a 
chromosome followed by stitching together of the resulting fragments 
by non-homologous end joining’; and (2) aberrant DNA replication 
resulting in fork stalling and template switching or microhomology- 
mediated break-induced replication”. Partitioning of a chromosome 
into a spatially isolated micronucleus can explain why a single chro- 
mosome would be subject to aberrant DNA replication and can explain 
how extensive DNA damage can be restricted to a single chromosome. 
Because micronuclei may persist as discrete entities over several 
generations, damage to the chromosome need not occur all at once**”’, 
but could accrue from multiple failed attempts at DNA replication. 
Chromothripsis restricted to a chromosome arm could be generated 
from acentric chromosome fragments, as chromosome pulverization 
was previously correlated with the presence of micronuclei in cancer 
cell lines after DNA damage***. Although chromothripsis seems to be 
relatively rare, DNA damage in micronuclei could be more common. 
We propose that chromothripsis may be an extreme outcome of a 
mutagenesis mechanism that could be widespread in human cancer. 


METHODS SUMMARY 


All cell lines were maintained at 37 °C with 5% CO, atmosphere. To generate micro- 
nuclei, cells were treated with nocodazole (100 ng ml ') for 6h followed by mitotic 
shake-off and nocodazole washout. Alternatively, micronuclei were generated by 
depletion of MCAK or NUF2 by short interfering RNA (siRNA). Micronuclei were 
also generated in HT1080 cells by inactivation of the kinetochore of the human 
artificial chromosome by washout of doxycycline. Detailed descriptions of 
immunofluorescence, RNAi, BrdU labelling, comet assays, inactivation of the 
human artificial chromosomes (HAC), flow cytometry, quantitative analysis of 
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nuclear import, chromosome spreads, spectral karyotyping, antibodies used, 
photoconversion of H2B-Kaede and long-term live-cell imaging experiments 
can be found in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culture. All cell lines were maintained at 37° C with 5% CO, atmosphere in 
DMEM (U208S, HT1080), DMEM:F12 (hTERT RPE-l), or DMEM:F12 medium 
without phenol red (H2B-GFP and H2B-mRFP U208, H2B-GFP and H2B- 
mREP RPE-1, H2B-GFP Caco2, H2B-GFP SCC-114, H2B-GFP HeLa, H2B- 
mRFP MDA-231). All media were supplemented with 10% FBS, 100IU ml! 
penicillin and 100 pg ml streptomycin. HT1080 cells bearing the alphoid'"° 
human artificial chromosome and expressing the tetracycline repressor TetR 
fused to the transcriptional silencer tTS (a gift from V. Larionov) were also 
supplemented with 1 jig ml” ' doxycycline (Sigma), 0.5 mg ml ' G418 (geneticin, 
Invitrogen) and 4 jig ml" blasticidin S (Invitrogen). U2OS cells stably expressing 
human NFATcI fused to the C terminus of EGFP (Thermo Scientific; R04-017-01) 
were supplemented with 0.5 mg ml’ G418. 

Generation of whole-chromosome micronuclei. To generate cells with whole- 
chromosome micronuclei, hTERT RPE-1 and U20S cells were treated with 100 ng 
ml ' nocodazole (Sigma) for 6h. Mitotic cells were collected, washed twice with 
fresh medium containing 10% FBS, and then plated into medium containing 20% 
FBS where they completed cell division. To overcome the aneuploidy-induced 
p53-dependent G1 arrest associated with nocodazole washout experiments, 
RPE-1 cells were transfected with 50 nmol p53 siRNA (Smartpool, Dharmacon) 
using Lipofectamine RNAi Max (Invitrogen) 12h before the nocodazole treat- 
ment, enabling cell-cycle progression of the subsequent aneuploid daughter cells 
with micronuclei. For NUF2 and MCAK knockdown, U20S cells were serum 
starved for 2 days before transfection with 50nmol NUF2 or MCAK siRNAs 
(Smartpool, Dharmacon). Cells were then washed in fresh medium containing 
10% FBS and 24h later treated with 4 1M dihydrocytochalasin B (DCB; Sigma) for 
16h to inhibit cytokinesis. The subsequent arrested tetraploid cells depleted of 
NUE2 or MCAK (many of which harboured micronuclei) were then treated for 6h 
with 10 1M SB203580 (Sigma), a p38 inhibitor that promotes cell-cycle progres- 
sion of tetraploids. Sequence information of the small interference RNA (siRNA) 
pools used from Dharmacon are as follows: human TP53 ON-TARGETplus 
SMARTpool siRNA L-003329-00-0005, (J-003329-14) 5’-GAAAUUUGCGUGUG 
GAGUA-3’, (J-003329-15) 5’-GUGCAGCUGUGGGUUGAUU-3’, (J-003329- 16) 
5'-GCAGUCAGAUCCUAGCGUC-3’, (J-003329-17) 5’-GGAGAAUAUUUCA 
CCCUUC-3'; human KIF2C/MCAK ON-TARGETplus SMARTpool siRNA 
L-004955-00-0005, (J-004955-06) 5’-GGCAUAAGCUCCUGUGAAU-3’, (J- 
004955-07) 5’-CCAACGCAGUAAUGGUUUA-3’, (J-004955-08) 5’-GCAAG 
CAACAGGUGCAAGU-3’, (J-004955-09) 5’-UGACUGAUCCUAUCGAAGA-3’; 
human CDCA1/NUF2 ON-TARGETplus SMART pool siRNA L-005289-00-0005, 
J-005289-06 5’'-GAACGAGUAACCACAAUUA-3’, (J-005289-07) 5'-UAGCUG 
AGAUUGUGAUUCA-3’, (J-005289-08) 5'-GGAUUGCAAUAAAGUUCAA-3’, 
(J-005289-09) 5'-AAACGAUAGUGCUGCAAGA-3’. 

Generation of micronuclei containing the HAC. HT1080 cells carrying the 
HAC were cultured in doxycycline-free medium and treated with 100 ngml ' 
nocodazole for 6h to synchronize cells in mitosis. The absence of doxycycline 
enables TetR binding to TetO, which induces inactivation of the HAC centromere 
by tTS and the subsequent formation of a micronucleus after release from mitotic 
arrest. Cells released from mitotic arrest were synchronized at G0/G1 by serum 
starvation for 24h, then released into the cell cycle with fresh medium containing 
10% serum for further analysis. 

Flow cytometry. Cells were fixed with 70% ethanol at 4 °C followed by incubation 
with 250 1g ml~' RNase A and 10 jg ml! propidium iodide at 37 °C for 30 min. 
FACS analysis was performed with a FACScalibur flow cytometer (Becton 
Dickinson) and data analysed with CellQuest software. 

Indirect immunofluorescence microscopy. For most experiments, cells were 
seeded on glass coverslips, washed in CSK buffer (10mM PIPES, 100mM 
NaCl, 300 mM sucrose, 3 mM MgCl), pre-extracted with CSK buffer containing 
0.5% Triton X-100 for 5min at 4°C, and fixed in PBS containing 4% 
paraformaldehyde for 15 min. After fixation, cells were permeabilized in PBS- 
0.2% Triton X-100 for 5 min, blocked in blocking buffer (PBS containing 5% FBS, 
2% BSA and 0.1% Triton X-100) for 30 min, and then incubated with primary 
antibodies at room temperature for 1h or overnight at 4°C. Cells were washed 
with PBS-0.1% Triton X-100 and incubated with fluorescence-conjugated 
secondary antibodies (1:1,000, Molecular Probes) at room temperature for 1h. 
Cells were also stained for DNA with Hoechst 33342 (1:5,000, Invitrogen) in PBS. 
More specific immunostaining protocols were as follows: for y-H2AX and other 
DNA damage-response proteins, as in ref. 46; for ELYS and mAb414, ref. 47; for 
cyclin B1/y-H2AX/BrdU, ref. 48, and for MCM2, CDT1 and ORC2, ref. 49. 
Images for most experiments were collected with a Yokogawa CSU-22 spinning 
disk confocal mounted on a Zeiss Axiovert microscope using 488, 561 and 640 nm 
laser light. A series of 0.3 j1m optical sections was acquired using a 100 1.4 NA 
Plan Apo objective with an Orca ER CCD camera (Hamamatsu Photonics). For 
nuclear pore quantification, images were collected on a Leica SP5 laser scanning 
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confocal microscope with a 405 nm laser and a white light laser tuned to 488 nm, 
568nm and 647nm using a X63 1.4 NA Plan Apo objective. Z-stacks were 
collected with a 0.3 j1m step size with pinhole at 1 Airy unit. Photomultipliers 
were calibrated to ensure linear range. For quantification of MCM2 and BrdU 
signals, mean fluorescence intensity within the primary nuclei or micronuclei was 
measured using Image] software. For quantification of y-H2AX, primary nuclei 
were classified as positive when five or more foci were detected, with the threshold 
of foci intensity set against the irradiated sample. The same threshold was applied 
to score y-H2AX foci in micronuclei. Acquisition parameters, shutters, filter 
positions and focus were controlled by Slidebook software (Intelligent Imaging 
Innovations). Images presented in figures are maximum intensity projections of 
entire z-stacks, unless otherwise stated. 

Antibodies for immunofluorescence. Samples were incubated with primary 
antibodies for human anti-centromere ACA (1:1,000; Antibodies, Inc.), rabbit 
y-H2AX-Ser 139 (1:500, Cell Signaling), mouse y-H2AX (clone JBW301, 1:500, 
Upstate), rabbit 53BP1 (1:500, Cell Signaling), rabbit phospho-53BP1-Ser 1778 
(1:500, Cell Signaling), rabbit phospho-CHK1-Ser 317 (1:500, Cell Signaling), 
rabbit phospho-ATM-Ser 1981 (1:500, Cell Signaling), rabbit ATR (1:500, Cell 
Signaling), rabbit MRE11 (1:300, Abcam), rabbit RAD51 (1:100, Santa Cruz), 
mouse BRCA1 (1:300, Santa Cruz), mouse RPA2 (1:500; Abcam), rabbit phospho- 
RPA2-S33 (1:500, Bethyl Laboratories), MCM2 and ORC2 (1:300, gifts from B. 
Stillman), rabbit CDT1 (1:500, Bethyl Laboratories), rabbit geminin (1:500; Santa 
Cruz), mouse monoclonal antibody 414 (1:5,000, Abcam), ELYS/MEL28 (1:1,000, 
gift from I. Mattaj), mouse lamin A/C (1:300, Abcam), rabbit cyclin A (1:300; Santa 
Cruz), rabbit cyclin B1 (1:500; Santa Cruz), rabbit cyclin D1 (1:300; Santa Cruz) 
and mouse BrdU (Sigma). Secondary antibodies used were Alexa Fluor 488 
(green), 594 (red) and 647 (far red) from Molecular Probes. 

TUNEL assay. TUNEL assay was performed according to the manufacturer's 
instructions (In situ cell death detection kit, Roche). 

BrdU labelling. Cells were pre-labelled with 10 1M BrdU for 30 min and subse- 
quently fixed and permeabilized according to the manufacturers’ conditions 
(5-bromo-2-deoxyuridine labelling and detection kit 1, Roche). Cells were then 
stained with anti-BrdU (1h, diluted 1:300) and processed for immunofluores- 
cence. In the box and whiskers plot, the box represents upper and lower quartiles, 
line within box represents median and the whiskers extend to the highest and 
lowest value data sets. 

Long-term live-cell imaging. Labelled cells were grown on glass-bottom 12-well 
tissue culture dishes (Mattek) and imaged on either a Nikon TE2000-E2 or Nikon 
Ti-E inverted microscope. Both microscopes were equipped with a cooled CCD 
camera (TE2000, Orca ER, Hamamatsu; Ti-E, Coolsnap HQ2, Photometrics), a 
precision motorized stage (Bioprecision, Ludl), and Nikon Perfect Focus, and both 
were enclosed within temperature and CO;-controlled environments that main- 
tained an atmosphere of 37 °C and 3-5% humidified CO. GFP or RFP images were 
captured at multiple points every 5-10 min for 3-5 days with either X10 (0.3 NA) 
or X20 (0.5, 0.75 NA) objectives, and exposure to fluorescent light was minimized 
to the greatest extent possible (all image acquisition used neutral density filters and/ 
or 2 X 2 binning). Cell viability was confirmed by the continuous observation of 
mitotic cells throughout the duration of experiments. Captured images from each 
experiment were analysed using NIS-Elements software. 

Imaging of H2B-Kaede. Tandem repeats of Kaede (a gift of A. Miyawaki) were 
fused to H2B and cloned into the pLenti6/V5 lentiviral vector (Invitrogen). A U2OS 
cell line stably expressing H2B-Kaede was generated by lentiviral infection followed 
by repeated FACS sorting of GFP* cells. For tracking experiments, pre-converted 
micronuclei (GEP *) were identified and photoconverted with a brief (1-3 s) pulse 
of 350-400 nm ultraviolet light from a DAPI filter cube. Micronuclei that efficiently 
converted (RFP*) were subsequently imaged through the completion of the fol- 
lowing mitosis using the same microscope setup as described above. Images were 
acquired every 10 min with a X20 objective and 2 X 2 binning. 

Chromosome spreads/spectral karyotyping. RPE-1 cells were treated with p53 
siRNA (as previously described) for 16 h and then grown in medium supplemented 
with nocodazole (100 ng ml ) or vehicle control (DMSO) for 6 h. Mitotic cells were 
collected by shake-off, washed thoroughly in fresh medium without nocodazole, and 
re-plated in culture medium with 20% FBS to complete mitosis. FACS analysis 
demonstrated that the subsequent daughter cells progressed to S/G2-phase 
18-20h after release. At this point, interphase samples were collected and fixed 
to observe the fraction of cells containing micronuclei. In parallel, 100 ng ml 
colcemid (Invitrogen) was added to the DMSO or nocodazole-treated cells for an 
additional 6h and then fixed. All samples were similarly processed for fixation: 
cells were pelleted and re-suspended in a hypotonic solution of 0.075 M KCl for 
18 min, fixed in Carnoy’s fixative (3:1 methanol:glacial acetic acid) and washed 
four times with Carnoy’s fixative. All fixed samples were spread on slides for 
staining or hybridization. Mitotic samples were stained with Giemsa or Hoechst 
to visualize chromosomal abnormalities. Spectral karyotyping (SKY) was performed 
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on interphase and mitotic samples (from both DMSO- and nocodazole-treated 
samples) according to the SkyPaint DNA kit H-5 for human chromosomes pro- 
cedure (Applied Spectral Imaging, SKY000029) and imaged on a Nikon Eclipse 
E6000 microscope equipped with the SD300 Spectracube and Spectral Imaging 
acquisition software. To determine if pulverized chromosomes were derived from 
late-replicating chromosomes in micronuclei, the above procedure was also per- 
formed with addition of BrdU labelling reagent (Roche Applied Science) only in 
the last 2 h of colcemid treatment before fixation. Mitotic samples from the BrdU 
samples were treated according to the 5-bromo-2'-deoxy-uridine labelling and 
detection kit I protocol (Roche Applied Science, 11296736001), except that the 
anti-BrdU antibody was diluted 1:50. 

Single-cell gel electrophoresis (comet) assay. Single-cell comet assays were per- 
formed according to the manufacturer’s instructions (Trevigen). Briefly, U2OS 
cells were collected at G1, S and G2 phases of the cell cycle after nocodazole 
washout was used to induce micronuclei. As controls, G1 cells were irradiated 
at 10 Gy. Cells were re-suspended in cold PBS at 2 x 10° cells ml’, mixed with 
low-melt agarose (1:10 ratio) and spread on frosted glass slides. After the agarose 
solidified, the slides were sequentially placed in lysis and alkaline solutions 
(Trevigen). Slides were then subjected to electrophoresis at 12 V for 10 min in 
1X TBE buffer, fixed with 70% ethanol, and stained with DAPI. Nuclei were 
visualized using epifluorescent illumination on a Zeiss microscope and images 
were analysed with the NIH Image J program. DNA damage was quantified for 50 
cells with micronuclei for each experimental condition by determining the tail 
DNA percentage using Comet Score (TriTek) software. 

Quantitative analysis of nuclear pore complexes. To determine nuclear pore 
complex density in primary nuclei and micronuclei, cells were pre-extracted, fixed, 
permeabilized and immunostained with monoclonal 414 and ELYS antibodies. 
Confocal images were collected on a Leica SP5 laser scanning confocal with both a 
405-nm and white light laser (at 488nm and 568nm) using a X63 Plan Apo 1.4 NA 
oil objective. Z-stacks were collected with a 0.3 jum step size with pinhole at 1 Airy 
unit. Photomultipliers were calibrated to ensure linear range. Single layers of the 
Z-stack showing optimal nuclear pore staining for the primary nucleus and the 
micronuclei were chosen for quantitative analysis using MetaMorph. The areas of 
primary nuclei and micronuclei were judged from Hoechst staining and used to 


determine the perinuclear rim area. Total fluorescence intensity of monoclonal 
antibody 414 and ELYS was measured in the perinuclear rim area. To compare 
nuclear pore densities between primary nuclei and micronuclei, the total nuclear 
pore fluorescence was divided by the perinuclear rim area of primary nuclei and 
micronuclei, resulting in an average intensity ratio. 

Nuclear import. To measure nuclear import, we used U2OS cells stably expres- 
sing the fusion protein NFATc1-EGFP (Thermo-Fisher Scientific). The inactive 
transcription factor NFATcl1 resides in the cytosol. Elevated calcium levels in the 
cytosol lead to the dephosphorylation of NFATc1 by calcineurin, thereby indu- 
cing its rapid translocation into the nucleus. To increase the calcium level in the 
cytoplasm, U2OS cells released from nocodazole washout were treated with 
0.2M thapsigargin, an inhibitor of the sarco-endoplasmic reticulum Ca** 
ATPase. Nuclear import of NFATc1 was then measured in the primary nuclei 
and micronuclei of U2OS cells at both 6 h and 20 h after nocodazole release. Cells 
were subsequently fixed with 2% paraformaldehyde and import was quantified 
based on nuclear fluorescence. To measure nuclear import of IBB-DiHcRed*, 
U20S cells were transiently transfected with the IBB-DiHcRed-plasmid (gift 
from J. Ellenberg) using Lipofectamine 2000 (Invitrogen). Cells were then syn- 
chronized for 10 h with nocodazole, washed as previously described, and released 
into the cell cycle. Cells were fixed with 2% paraformaldehyde both 6h and 20h 
later, and import was quantified based on nuclear fluorescence. 
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prostate or blood cancers — also display 
epigenetic changes that are crucial for tumour 
development and maintenance. Future experi- 
ments, including integrated genomic analyses, 
may provide the answer. 

The underlying mechanism of alterations 
in gene expression by epigenetic means in 
retinoblastoma is still unclear. But Zhang and 
co-workers’ observation’ that 13% of these 
tumours show recurrent mutations in the 
BCOR gene offers a possible explanation. The 
BCOR protein is highly expressed in the fetal 
retina and is essential for eye development, 
as evidenced by a congenital eye disorder, syn- 
dromic microphthalmia, which results from a 
heritable mutation in BCOR’. BCOR associates 
with proteins that repress gene expression 
epigenetically*. This raises the possibility 
that loss of BCOR function, due to acquired 
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mutations in its gene, may contribute to an 
altered epigenetic landscape in RB1-deficient 
retinal cells. Consequently, inappropriate 
expression of crucial genes may impair cell 
maturation and so facilitate the progression of 
retinoblastoma. It will be of interest to deter- 
mine whether BCOR is part of the repression 
machinery that silences SYK expression in the 
normal retina, and whether acquired BCOR 
mutations provide at least one route to altered 
expression of SYK in retinoblastoma. 

Nevertheless, the current work — using a 
comprehensive, integrated genomics approach 
— is notable not just for demonstrating that 
epigenetic alterations have a predominant role 
in the progression of retinoblastoma. The new 
possibilities it raises for therapies in this child- 
hood malignancy, and possibly in other types 
of tumour, are equally noteworthy. = 


A ratchet for protein 


complexity 


Molecular machines containing related protein subunits are common in cells. 
Reconstruction of ancient proteins suggests that this type of complexity can 
evolve in the absence of any initial selective advantage. SEE LETTER P.360 
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rganisms and cells are bewilderingly 
complicated, and the molecular 
machines that perform many basic 
cellular functions are often giant, multi- 
subunit, multifunctional protein complexes 
with tangled evolutionary histories. It is 
generally assumed that such complexes 
arose by the stepwise accretion of indi- 
vidual proteins, each addition represent- 
ing a selective advantage by adding to or 
refining the machine’s performance. But 
on page 360 of this issue, Thornton and 
colleagues’ argue against this standard 
explanation in one particular instance — 
that of a ring-shaped protein complex in 
fungi. The authors show how evolutionary 
processes entailing loss of function rather than 
gain might initially drive a system towards 
complexity, independently of selection*. 
Many cellular molecular machines con- 
tain several proteins that self-assemble into a 
multi-subunit ring. In simple cases, rings are 
homo-oligomeric; that is, all the subunits 
are identical and thus probably the products 
of a single gene. In more complex (hetero- 
oligomeric) examples, the protein molecules 
in the ring are different, but may be related. 


“This article and the paper? under discussion were 
published online on 8 January 2012. 


Often, hetero-oligomeric rings seem to have 
arisen from homo-oligomeric complexes 
after a gene encoding a single subunit became 
duplicated, producing two or more gene 
copies (called paralogues), with each copy 
subsequently evolving to encode a slightly 
different protein. 
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The question that Thornton and col- 
leagues address is, why does this happen? 
Do the structurally distinguishable subunits 
and/or their specific pattern of assembly confer 
improved or additional functions on the pro- 
tein complex, with selection for their enhanced 
performance being the evolutionary driving 
force? Alternatively, might neutral processes 
be responsible, at least initially? 

To understand how the latter possibility 
might come about, imagine that a gene encod- 
ing, for example, eight subunits of a homo- 
oligomeric ring undergoes duplication (Fig. 1). 
The two paralogous gene copies that this dupli- 
cation produces — and the protein subunits 
they encode (call them A and B) — will of 
necessity begin to diverge through accumu- 
lation of neutral mutations. These mutations 
might not at first affect any of the subunits’ 
functions, so that functional octameric rings 
could continue to form bya random sampling 


Figure 1 | Evolution of complexity in a protein ring. a, Homo-oligomeric protein complexes contain 
identical subunits (A) that are typically encoded by a single gene. b, If the gene is duplicated, the two gene 
copies can diverge through the accumulation of neutral mutations, generating structurally distinguishable 
but functionally unaltered subunits A and B, which can form functional hetero-oligomeric rings by 
random mixture. c, If additional mutations prevent subunits from binding to others of their own type, 
functional rings could still be formed by alternating subunit types. As further mutations accumulate, the 
probability of returning to the initial homo-oligomeric situation becomes very small. Thus, the subunit 
composition of a protein association may become complex in the absence of initial selective advantages. 
Thornton and colleagues’ provide experimental evidence suggesting that this type of process has occurred 
in the evolution of a six-membered protein ring that forms part of the vacuolar H’-ATPase enzymes in fungi. 
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of A and B subunits. But further mutations 
might result in each subunit losing the ability 
to bind others ofits own type, A to A or B to B. 
However, as long as A still binds B and B still 
binds A, functional rings can form. 

As A and B continue to co-evolve through 
the acquisition of further neutral mutations, the 
likelihood of reversing back along the muta- 
tional path to the initial single-gene homo- 
oligomeric complex becomes vanishingly 
small. We can thus see the homo-oligomer to 
hetero-oligomer transition as an inexorable 
evolutionary ratchet. And even if there were 
some slight selective disadvantage to using 
a hetero-oligomeric complex (for example, 
complex formation could become slower), this 
could be overcome in small populations — in 
which the efficiency of selection is reduced 
and mildly harmful mutations can be fixed 
by chance’. 

Some chaperonins — molecular machines 
that assist other proteins to fold into functional 
configurations — have until recently pro- 
vided the best evidence for such a scenario. 
Microorganisms known as archaea have 
one, two or three paralogous genes encoding 
chaperonin subunits that form rings: homo- 
oligomeric eight-membered rings when 
there is a single gene, or hetero-oligomeric 
eight- or nine-membered rings when there 
are two or three genes. Such paralogy seems to 
reoccur frequently in archaea, and a neutral 
co-evolutionary picture of the sort described 
above has been suggested’ to explain it. 

Comparative bioinformatic studies* sup- 
port the idea that hetero-oligomeric archaeal 
chaperonins consisting of two different sub- 
units are not functionally distinct from the 
homo-oligomers, and thus did arise neutrally. 
Adaptive changes can follow later, however. 
Indeed, there is direct evidence for specialized 
roles for the eight distinct chaperonin subunits 
of eukaryotes (organisms such as plants, 
animals and fungi)”. 

So a more realistic picture might be as 
follows. Rarely, gene duplication is quickly 
followed by functional differentiation (as pro- 
posed in early formulations of the hypothesis 
of evolution by gene duplication’). Selection 
against loss of either duplicate would then be 
ensured early on. Perhaps more often, though, 
the paralogous condition might first be locked 
in by the above-described co-evolutionary 
ratchet, allowing functional differences (if 
any) to be acquired later (if ever). That is what 
Thornton and colleagues’ argue has happened 
in the case ofa six-membered hetero-oligomeric 
ring that forms part of the enzymes known as 
vacuolar H*-ATPases (V-ATPases) in fungi. 
Their argument is especially compelling 
because it rests not only on bioinformatics, but 
also on experiments, including the synthesis 
and testing of proteins that the authors propose 
are ancestral to the V-ATPase subunits. 

In most eukaryotes, the V-ATPase six- 
membered ring is made up of five copies of the 


protein Vma3 and one of its paralogue Vmal6. 
In fungi such as budding yeast, by contrast, one 
of the five Vma3 proteins is replaced by Vma11, 
which is a fungus-specific paralogue of Vma3. 
The abilities of these subunits to bind to one 
another’ are such that Vma11 must lie between 
Vmal6 and Vma3 in the ring, and Vma3 cannot 
form the interface with the side of Vma16 that 
Vmal11 can. This represents a sort of functional 
degeneration, because the last common ances- 
tor of Vma3 and Vma11 must have been able 
to interface with either side of Vmal6, as Vma3 
must still do in organisms other than fungi. 

Thornton and colleagues’ synthesized 
proteins that matched the phylogenetically 
inferred ancestor of Vma3 and Vma11, as well 
as that of Vma16 and other intermediates in 
the predicted evolutionary pathway. They 
demonstrate that Anc.3-11, the inferred 
ancestor of fungal Vma3 and Vma11, does 
indeed interface with either side of Vmal6. 
The authors tested the functionality of these 
and other inferred evolutionary ancestors and 
intermediates by expressing them in yeast 
mutants that lacked current versions of 
one or more of the three genes. According 
to this picture, when Anc.3-11 underwent 
duplication, one paralogue (which was to 
become modern fungal Vma3) lost the ability 
to interact with the ‘anticlockwise’ side of 
Vma16. The other paralogue (which became 
Vmal1) lost the ability to interact with 
the ‘clockwise’ side of Vmal6 and/or the 
anticlockwise side of Vma3. 

A general theory, sometimes called construc- 
tive neutral evolution, to explain how neutral 
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processes might drive a system towards com- 
plexity is more than a decade old*””. But the 
study by Thornton and colleagues’ may provide 
the most compelling experimental evidence to 
date. Of course, one can never prove that some 
subtle, unidentified selective advantage was 
not involved in the evolution of the V-ATPase 
protein ring, but neutrality would seem the 
most justifiable default hypothesis. Thus, a 
neutral theory of molecular evolution, normally 
invoked for nucleotide substitutions, may also 
apply to certain higher-order structures such 
as multi-subunit protein rings. How general 
such neutral mutational drives to complexity 
might be is one of evolutionary theory's deeper 
unanswered questions. m 
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Distant dwarfs 


Astronomers value the smallest galaxies for the clues they can provide about how 
galaxies form. But only those nearest to us are easily detectable. A neat technique 
has allowed one such object to be found at a large distance. SEE LETTER P.341 


ROBERT W. SCHMIDT 


ome objects do not shine brightly enough 

to be seen easily. Examples of this are the 

small and extremely faint galaxies called 
dwarf or satellite galaxies. Only when they are 
near to our Galaxy might we be lucky enough 
to detect the light of their stars. Satellite galax- 
ies are of great interest because simulations of 
the process of galaxy formation predict that 
there are many more satellites in the Local 
Group, which includes the Milky Way and 
other galaxies in its vicinity, than have yet been 
found'”. This begs the question of whether 
there is something wrong with our under- 
standing of how galaxies form. On page 341 of 
this issue, Vegetti et al.* report the discovery of 


a satellite in the vicinity ofa galaxy located at a 
cosmological distance from Earth that allows 
them to test the simulation predictions for gal- 
axies beyond those of the Local Group. 

The object studied by Vegetti et al. is a distant 
elliptical galaxy that acts as a gravitational lens: 
light from a background source is deflected 
from its otherwise straight path by a massive 
foreground object, the lens. When compared 
with camera lenses, gravitational lenses have 
rather dismal optical properties because they 
can lead to large distortions and multiple 
images of the background object. But from an 
astronomer’s point of view this is a good thing: 
the mass of the gravitational lens can be meas- 
ured by reconstructing the background object 
from the distorted images. 
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